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Abstract. We calculate the multiplicity function of mat- 
ter condensations with given mass and defined by an ar- 
bitrary density threshold which can be a function of the 
mass of the objects, by directly considering the actual, 
deeply non-linear density field, which we compare to the 
popular Press-Schechter approximation. This comparison 
is made possible owing to an analytic description of the 
non-linear mass distribution, but also thanks to a modcl- 
lization of the evolution of the two-body correlation func- 
tion from its linear to non-linear behaviour based on Pee- 
ble's spherical collapse picture. We show the mass func- 
tion is a function of a unique parameter that contains 
all the dependence on the mass and radius of the object. 
We compare this new, still simple, analytic model to a 
more involved formulation that should be even closer to 
the results obtained using the standard density threshold 
algorithms. This gives some hindsight into the "cloud-in- 
cloud" problem, for both Press-Schechter and non-linear 
prescriptions, and it enables us to derive the mass func- 
tion of general astrophysical objects (in addition to just- 
virialized halos) which may be defined by any density 
threshold, that may even vary with the mass of the object. 
This is beyond the reach of usual formulations based on 
the initial gaussian field and gives a clear illustration of 
the advantages of our approach. 

We explain why numerical tests seem to favor both 
Press-Schechter and non-linear prescriptions even though 
the two approximations differ by their scaling as a function 
of mass as well as a function of redshift. We argue that 
numerical simulations will be closer to the non-linear pre- 
dictions and should be reexamined in the light of our find- 
ings. The difference between the two is due to the fact that 
the Press-Schechter prescription assumes that present-day 
mass fluctuations can be recognized in the early linear uni- 
verse, and that their number is conserved, while the non- 
linear approach takes into account their evolution which 
leads to an increase of the number of highly non-linear 
objects (very large or small masses). 

This difference is seen to be of the same magnitude as 
the difference obtained by varying the initial spectrum of 
density fluctuations. Earlier conclusions drawn about the 



relevance of the latter using analytical approximations to 
the mass function must thus be reexamined. 
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1. Introduction 

There are traditionnally two ways to characterize the quite 
irregular distribution of matter (and light) at extragalac- 
tic scales. The first is to count galaxies of given luminosity 
to define a luminosity function. Since Press & Schechter 
(1974) and Schechter (1976) it is well accepted that the 
latter is a low luminosity diverging power-law with a sharp 
cut-off at the bright end. A related choice is to count very 
massive clusters (Abell 1958) or somewhat looser associa- 
tions of galaxies (Turner and Gott 1975). This allows one 
to determine a multiplicity function^ that is an abundance 
of objects as a function of mass, or luminosity, or any 
other parameter used to define the objects. An interesting 
similarity can be found (Bahcall 1979) in the shape of all 
these distributions. 

The same clustering properties may be described by 
measuring the galaxy two-body correlation function. As 
may be expected from the existence of dense galaxy clus- 
ters, the systematic Lick survey, that allowed the first sta- 
tistically significant studies, showed (Groth and Peebles 
1977) this correlation function to be quite large, the three- 
body correlation being even larger. Obviously, one may 
then suspect the many-body correlations to be even larger. 
These correlation functions may be determined by divid- 
ing the universe into cells, as the statistics of the counts- 
in-cells is intimately related to these correlation functions 
at high order (White 1979). Whether galaxies or the mass 
contained in the cells is considered, the theoretical evalua- 
tion of these counts shows, astonishingly (Schaeffer 1985, 
Balian and Schaeffer 1989, hereafter BS89) a diverging 
power-law behaviour for small counts, with an exponen- 
tial cut-off, quite similar to the one observed for the mul- 
tiplicity functions. This behaviour, specifically, is related 



2 



Valageas & Schaeffer: The Multiplicity Function of Galaxies, Clusters and Voids 



to the very large correlations -and whence fluctuations- of 
the counts at the deeply non-linear scales. 

With the availability (Peebles 1982) of the first real- 
istic initial spectrum of fluctuations, in the sense that it 
is indeed related to physical processes in the primordial 
universe, the Press-Schechter (PS) approximation was re- 
vived (Schaeffer and Silk 1985) to usefully supplement nu- 
merical simulations (Davis et al 1985). Such an approach, 
however, was violently questioned (Bardeen et al 1986), as 
were different other alternative solutions to the problem 
of defining analytically a mass function, because of the 
cloud-in- cloud problem. Indeed, a given density threshold 
defines an object. But, because the density is fluctuat- 
ing, this object may contain smaller substructures with 
larger ovcrdcnsitics surrounded by undcrdcnsc regions so 
that the full larger object still is at the required density. 
When the importance of over and underdense regions is 
evaluated, as required in the PS approach, using the gaus- 
sian initial conditions that are assumed to prevail in the 
early universe, this problem turns out to be serious. The 
users of this approximation, however, have argued that 
this approximation is much better than what it stands 
for, namely the true count of objects defined by a given 
overdensity, and provides for the correct qualitative -if not 
quantitative- multiplicity of condensed objects. Specific 
tests (Efstathiou et al 1988) against numerical simulations 
show that the mass function using this approximation and 
the computed ones follow similar trends, but (Eke et al 
1996) in some cases they compare well, and in some cases 
not at all. The PS approach, nevertheless, remains the 
most popular way to calculate analytically the mass func- 
tion (White 1993). 

We however think the situation is quite unsatisfactory. 
The main reason is that from counts in cells, we see that 
the matter distribution at scales, say, below 10 Mpc at 
least, is wildly modified by non- linear evolution. This has 
been amply checked (Alimi et al. 1990; Maurogordato et 
al. 1992, Benoist et al. 1996; Bouchet et al. 1993) against 
the CfA, SSRS or IRAS maps of the galaxy distribu- 
tion, as well as (in great detail) against numerical simula- 
tions (Bouchet et al. 1991; Bouchet and Hernquist 1992; 
Colombi et al. 1992,1994,1995,1996). The dependence of 
the counts on scale (according to the power-law be- 
haviour of the correlation function) and on time (accord- 
ing to the (1 + z)^*^"*^^) dependence of the latter) follows 
the noM-imear predictions. Were the PS formulation exact, 
the mass multiplicity function would scale as the linear 
correlation function extrapolated to the epoch under con- 
sideration (and in particular to the present one). Although 
such a feature would not be impossible, such a behaviour 
is a priori unlikely, given what we know from the counts- 
in-cells. Also, some of the detailed numerical tests to see 
whether not only the result but also whether the assump- 
tions made in deriving the PS formula are correct are far 
from being successful (White 1993, sect 2.3.6). This calls 
for a reexamination of the problem of the mass function 



and its calculation by analytical means, in the light of the 
experience acquired in understanding the behaviour of the 
counts-in-cells. 

In this paper, we construct the mass function at a 
given epoch from the non-linear density field at the same 
epoch, whose statistical properties we know. We present 
our formulation in Ch.2 . We discuss the various possible 
definitions of the mass function, how they are related to 
the hierarchical clustering pictures, their advantages and 
drawbacks. The distribution of underdense regions (with a 
low matter density) and of galaxy voids (which implies the 
proper inclusion of discreteness effects) can be obtained 
by the same methods. The result will be that, as it was 
the case for the density fluctuations, the mass multiplicity 
function can be described by i) a scaling function H(x) 
that may depend on the initial conditions but does not 
evolve with time in the fully non-linear regime, and ii) the 
time-dependent non-linear two-body correlation function. 
The important many-body correlations are taken into ac- 
count by the former scaling functions, that depend on a 
variable x close to the internal velocity dispersion of the 
object. This function turns out to exhibit a power-law 
for small x and an exponential fall-off for large x. The 
time evolution of the two-body correlation function can 
be modelled (Ch.3) by a totally analytic calculation using 
an extension, based on the spherical collapse picture, of a 
formulation due to Hamilton et al (1991). In Ch.4 we first 
compare numerically the predictions obtained using the 
PS prescription and our new model based on non-linear 
clustering, which differ significantly. We finally push the 
theoretical comparison as far as possible within our knowl- 
edge of the dynamics of clustering, the difference in the 
two approaches being reduced to one key assumption on 
the evolution of the number of mass condensations as the 
universe goes from the linear to non-linear regimes. In the 
last section (Ch.5) we present the mass functions of gen- 
eral astrophysical objects (in addition to the usual just- 
virialized halos) defined by an arbitrary density threshold 
which can be a function of the mass of the objects. 

As a conclusion, we propose several tests by means of 
computer simulation that may be performed, as well as 
the possible applications of the method. 

2. The multiplicity function: non-linear approach 

Provided one is able to describe the density fluctuations 
in the non-linear regime, it is possible to deduce the mul- 
tiplicity function from the known distribution of overden- 
sities above a given threshold A in cells of size R. The 
density fleld to be considered is, obviously in this case, 
the one relevant to the epoch at which the mass function 
is seeked. It is done in the same spirit as the very popular 
PS approach, (rephrased -App.A- in a way that parrallels 
the present non- linear description) , but amounts to count 
the overdensities directly where they are, in the non-linear 
regime, rather than trying to identify the future overden- 
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sities within the linear fluctuations in the early universe. 
The properties, outlined by BS89, of the non-linear den- 
sity field needed to our purpose are reviewed in App.B. 

2.1. Counts-in- cells 

The knowledge of the two-body correlation function and 
the theoretically predicted scaling of the counts-in-cells, 
that has been seen to hold at the present epoch and that 
we assume to hold also in the past, allows us to compute 
the probability P{5) to have a cell of density 5, for a given 
partition of the universe in cells of size R, at an epoch 
when the matter distribution is non-linear. 

As seen in App.B, the fraction of matter in cells with 
overdensity larger than A is, provided the mild constraint 



1 + A»^ 



-w/(l-a;) 



, that is a; ^ ^ 



is satisfied, 



For a matter correlation function ^(i?) = {R/Rq)~'^ , 
the mass function is a power-law ri{M) dM/M oc 
M" dM with a slope somewhat steeper than a = —1 
at small masses 

M <^M^ : r){M) oc M'^+'^t/^, 

that is a = -2 + uj'y/S . At M » M*, it bends down and 
decreases exponentially 

M > M» : r]{M) oc m-^+^^+'^'^'t/^ exp[-{M/M^y/^] 
where 

= (l + A)-(3/7-i) ^3/^ Mo 

and 



F„(>A)= / {1 + S)P{S)d6 :^ / xh{x)dx (1) = Po 47r/3 i?o =i 3 10 Mq, 

JA Jx{R) 



with x{R) = 



1 + A 

m 



and we can identify the fraction of matter n{R)dR/ R in 
overdense cells truly at scale R with the derivative of 
F„(> A,R) : 

KR)^=-^F^{>A,R)dR = xhix)^dR (2) 

With M = {1 + A)yOoV^, where V is the volume of the 
elementary cell, we can take as a possible definition for 
the mass function 



its precise mimerical value depending on the value of O 
and the matter correlation length Rq. 

So, with x^ ~ 10, for A ~ 200, a typical density con- 
trast for clusters, it bends at M* ~ 4 10^'' M©, whereas 
for A ~ 5000, characteristic of a galaxy, the bend indeed 
is at M* ~ 5 1O^^M0, a typical mass for an galaxy. 

The behaviour of the mass function is displayed on 
Fig.l for two different values of A. 



, , ^, dM po , , , , 



with x{M) 



1 ■ A 

mm] 



(3) 



which implies a well-defined scaling of the mass function 
function of the threshold A: different choices of A 
lead to self-similar distributions. 

Bernardeau and Schaeffer (1991) tested successfully 
against observations some of the consequences of the scal- 
ing implied by (3), by comparing galaxy and cluster multi- 
plicities. This approximation to the multiplicity function 
as well as several other possibilities with the same scal- 
ing had been discussed earlier by BS89. Among all these 
early attempts, the first one is, in our mind, the only one 
that should be kept. Our approach in particular has the 
advantage of obeying the normalization condition 
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A = 5000 \ 
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f — 77(M)— = / xh{x)dx = 1 
Po M Jo 



(4) 



since M = corresponds to i? = and ^ = +oo, that is 
a; = 0, whereas M = +oo corresponds to x = -|-oo : all the 
matter and not just half of it, as would be the case using 
linear theory in the PS framework, is within the condensed 
objects. 



-4 -2 

Fig. 1. Behaviour of the mass function r]{M) dM/M. 
The graph shows Vo r]{M), where Vb = 47r/3 R^, with 
u) = 1/2 , = 5 , 7 = 1.8, in the cases A = 5000 (dashed 
line) and A = 200 (solid line). The former is relevant to galax- 
ies, whereas the latter describes galgixy clusters. The difference 
in M, indeed corresponds to the difference between the typical 
mass of a galaxy and a galaxy cluster. 



As explained in App.B, the scaling with h{x) holds pro- 
vided the correlation function is much larger than unity, in 
the stable clustering regime. For values of ^ ~ 1 the non- 
linear behaviour slowly transforms into the linear gaus- 
sian distribution, but only for overdensities A smaller than 
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unity. For the larger A relevant to our approach since we 
count real clusters that already exist, and not protoclus- 
ters that will potentially bcconic non-linear, we enter the 
quasi-gaussian regime (Bernardeau 1992, 1994). The gaus- 
sian develops an exponential tail that has just the same 
scaling with A as the tail of h{x) but with a different 
function h. We could interpolate between the two regimes 
in the way suggested by Colombi et al (1996). However, 
for ^ > 10, the difference is minor. When ^ = 1, we have 
X = {1 + A)/^ 1 and we are already very far in the 
exponential tail of h{x), so there are practically no more 
clusters at this scale and the precise form of h{x) is unim- 
portant. For most practical cases we will not go beyond 
these regimes, and will use the non-linear h{x) through- 
out. However, for ^ < 10 and moderate values of 1 + A, 
such an interpolation would have to be considered. 

2.2. Surroundings of a cell 

To define overdensities non-linear at scale R, another pos- 
sibility is to consider explicitely the surroundings of each 
cell. The joint probability P{S, R; S', R + dR) (or simply 
P((5, 5') ), for a cell of scale R to have a density 5 whereas 
for a slightly larger scale R -\- dR the density is 5' , is con- 
structed for the non-linear case in App.C. 

The mass fraction in cells with overdcnsity larger than 
A in R, but smaller than A in i? -|- di?, is then 

Fm{> A,R;<A,R + dR) = fm{> A, < A; R)dR 



oc /* A 

d6 / d6' (1 + 6) P{6, 6') 

A J-1 



= (1 + A) P{> A,< A) 



(5) 



and is of order dR. This calls for another possible defini- 
tion of the mass function 



^(M)^ = ||/^(> A,<A-R)dR 



(6) 



which can be usefully compared to the previous definition. 

Clearly, if the latter makes any sense and indeed de- 
scribes overdensities A with mass M, the two definitions 
should not be very different. The rather technical -but 
badly needed- calculations used to compare both mass 
functions are given in App.C. The difference between the 
two approximations, evaluated in the non-linear case con- 
sidered here, also gives some information on how the mass 
function depends on the way in which the objects are de- 
fined: directly as overdensities surrounded by underdensi- 
ties, as considered here, or more implicitely by the number 
of overdense cell of mass M compared to the number as- 
sociated to an infinetesimally larger mass as considered 
above. 

As shown in App.C, it is quite interesting to note that 
ri{M) defined in this section obeys the same scaling re- 
lation as in the previous case. In particular, the trend of 



the mass function described above, a power-law at small 
masses and an exponential fall-off at large masses, is ex- 
actly the same. Indeed, we have 



dM 



Po 



M M 



xH{x)dx 



(7) 



with a function H{x) that differs from h{x). However, as 
shown in App.C, H{x) and h{x) have in general the same 
behaviour oc x^~'^ at small x, with a different normaliza- 
tion but the same value of w, and ~ exp(— .x/a;*) at large 
X. Explicit models for h{x) show they may not differ by 
more than 10 per cent, and we have the general bounds 
(see (C19) and (C20)): 

a; > : < -7--^ < - and X » a;* : 1 < -7^ < -(8) 

h[x) 7 h[x) 7 

So, the two definitions for the mass multiplicity functions 
we have retained are quite similar. The function h{x) is 
by definition properly normalised, but H{x) is not nec- 
essarily: as readily seen in practical implementations of 
an agorithm that recognizes condensations by means of a 
density threshold, there is some overcoimting (this is the 
cloud-in-cloud problem) in this case. This is actually in 
favour of the definition (3) of section 2.1 which is simpler 
and has the interesting property that the total mass frac- 
tion within the objects is exactly unity by construction. 

2.3. Comparison with the PS approach 

The same overcounting encountered for the mass frac- 
tion i-ixii') defined in this way when overdensities are 
evaluated in the linear regime with gaussian probabilities 
(App.A) is much more severe, since in this case, overden- 
sities with a given threshold are counted infinitely many 
times at different scales. Indeed, the total mass counted at 
various scales sums up to infinity. This is due to the fact 
that for small scales E 00, hence the density contrast 
averaged around a given point shows wide fluctuations 
(^ S) as the "smoothing" scale R varies and it crosses 
the threshold Sc many times, whence a large overcount- 
ing. This problem was addressed by Cole (1989), Bond et 
al.(1991), who calculated at each scale R the mass which 
lies above Sc at a larger scale. The derivative gives then 
the mass fraction which lies above 5c at a scale between R 
- R + dR, and is below this threshold at all larger scales, 
which solves the "cloud-in-cloud" problem. Using a top- 
hat in k for the window function they were able to show 
that the mass function obtained in this way is simply the 
PS result multiplied by the usual factor 2. However, this 
window function is quite peculiar as in this case the incre- 
ment of the density at a given scale is statistically indepen- 
dent of the larger scales, which may modify qualitatively 
the "cloud-in-cloud" problem. 

It is clear that the mass function (6) described above, 
based on P(> Sc, R; < Sc, R+dR), overestimates the mass 
fraction at scale R since one should rather evaluate, as 
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Cole (1989), P(> 5c,R;< (5c, Vi?' > R + dR) which is 
free of the "cloud-in-cloud" problem. At the small mass 
end, the mass function obtained from (6) is a very bad 
approximation and it leads to a total mass which is infi- 
nite, showing indeed that the "cloud-in-cloud" problem is 
severe at these scales and may lead to a substantial mod- 
ification of the PS result. On the other hand, we find that 
for a window function which is a top-hat in i? or a gaus- 
sian (or any window function which is differentiable in k) 
it tends to the Press-Schechter result without the factor 2 
at the large mass end. This shows that the usual multipli- 
cation of this latter mass function by an overall factor of 
2 to get the right normalization cannot be justified by the 
excursion set results, for realistic window functions dif- 
ferent from the top-hat in k. In fact, taking into account 
the "cloud-in-cloud" problem implies to multiply the PS 
mass function by a scale-dependent renormalisation fac- 
tor, smaller or equal to unity for large u. 

Even if this geometrical problem were solved, one 
would still face the problem of the modellization of the 
dynamics one has to assign to these patches of matter to 
derive the properties of the non-linear density field ob- 
tained later and to get the number of virialized halos. 
Moreover, it is clear from the previous discussion that the 
number of large mass virialized objects obtained with the 
use of the common spherical collapse dynamics would un- 
derestimate the results of numerical simulation by ^ 50%, 
since to match roughly these latter results one needs to 
multiply the PS mass function by at least the usual factor 
2. This casts some serious doubts on the validity of this 
approach. 

In the non-linear case, the dynamics which governs 
structure formation, from the initial gaussian universe to 
the final highly non-linear density field, is partly encap- 
sulated into the scaling function h{x) (or the coefficients 
Sn)- At this stage, h{x) cannot be predicted from the ini- 
tial conditions, and has to be measured in numerical sim- 
ulations. However, generic predictions can be made from 
the sole assumption of stable clustering, that is the scale 
invariance of i^(y), see App.B., which can be applied next 
to any case provided one knows h{x). Note moreover that 
the measure of /i(.t) needs only to be done once, if the 
scale-invariance holds. Then it will apply for any time or 
scale, in the highly non-linear regime. The advantage of 
this approach is that all predictions are directly derived 
from the real non-linear density field, and not from the 
initial gaussian conditions extrapolated by a highly un- 
certain dynamical model which leads to some problems 
(normalization, proper calculation of the mass function). 
Moreover, we showed in the previous section that, as an 
extra-bonus, some of these geometrical problems (over- 
counting) encountered for a gaussian field do not lead to 
very serious difficulties when one works directly with the 
actual highly non-linear density field. Indeed, the peaks 
are well separated as the fraction of volume they occupy 
tends to and not 1/2 as in the gaussian case, when the 



scale tends to 0, see (B22) and (B24). Thus, the same mea- 
sure of the overcounting due to the cloud-in-cloud problem 
in the non-linear case shows this problem not to be a seri- 
ous difficulty when the overdensities are directly counted 
in the non-linear regime, contrary to the counting based on 
initial linear gaussian fluctuations. This suggests strongly 
that the mass functions constructed from (2) or (6) are 
very close and provide a reliable approximation. 

We can note that in our definitions (2) and (6) for the 
mass function, what we actually consider is the proportion 
of matter formed by particules such that each of them is 
at the center of a sphere of radius R with density contrast 
A. Then we identify this with the mass fraction embedded 
within virialized halos of scale R. Thus, the "objects" iden- 
tified in such a way are not necessarily spherical, although 
for large x one may expect them to have increasingly 
spherical shapes. At first glance, an alternative method to 
count "actual" halos would be to start from an unknown 
distribution of spherical halos ri{M, R) dM/M dR/R of 
size R and mass M, with a given density profile, and de- 
rive from this the probability distribution of the counts- 
in-cells. Then, one gets an integral relation between both 
quantities which one may try to inverse within certain ap- 
proximations. However, we think that such a procedure 
cannot give very satisfactory results as it neglects sub- 
structures and density fluctuations within halos, which 
play a major role in the properties of the density field and 
are necessary if the scaling invariance of the coefficients Sn 
is to be realised. An obvious way to see this, is to note that 
within such a picture of a collection of regular halos with 
a well-defined density profile, the mass which lies above a 
certain density contrast A tends to zero for very large A, 
whatever small the "smoothing" radius R might be. This 
is clearly in contradiction with (B23), which shows that 
(within the framework of our description of the non-linear 
density field) for any threshold A one counts nearly all 
the matter for a sufficiently small smoothing radius. As 
a consequence, the definitions (2) and (6) seem the most 
reasonable choices. 

2-4-. Underdense regions 

It is possible to define underdense regions in a way similar 
to the one we have used to define clusters. Let us define 
a given density threshold (1 — A) below which a region 
will be considered as underdense. It is more sensible in 
this case to count the volume occupied by the underdense 
cells. There arc, again, two definitions that may be used to 
define the number of such regions of a given size R. Note 
that the latter scale is the true scale of the underdense 
volume. 

The volume fraction occupied by the underdense cells 
in a partition of the universe into cells of scale R is 

Fy{<-A,R) = J P{5)d5 (9) 
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For (1 — A) <g; which for large ^ is true for nearly any 
value of (1 — A) that is smaller than unity, obeys the 
scaling that results from the considerations of App.B 



F^{<-A,R)= / g{z)dz 

'0 



(10) 



with 



^ = (1 - A) a- 



-l/(l-a.) 



w/(l-w) 



that is, it does depend on only one scaling variable, and 

not on the two quantities A and R separately. So the vol- 
ume fraction of cells truly at scale between R and R + dR 
is 

m = -^Fv{< -A,R) dR = -g{z)dz 

and the number density of underdense regions of scale R 

or volume V is then 



dR 1 , ^ , 

Vu{R) -^ = - Y 9{z) dz 



(11) 



The normalization properties of the function g{z) then 
show that the total volume fraction of these low-density 
regions is equal to unity 



/•OO JD POO 

F, = V r,u{R) ^ = I 9{z) dz = 1 



(12) 



as expected since underdense regions at the non-linear 
scales occupy nearly all the volume, although all the mass 
is in the overdense regions. 

For reasons explained in the next section, testing in 
galaxy catalogues the scaling with g{z) in the underdense 
regions is difficult because of discreteness effects, but it can 
be done (Alimi et al. 1990, Maurogordato et al. 1992) and 
works within the limits of the accuracy of the data extrac- 
tion from catalogues. This scaling is seen to work beauti- 
fully in numerical simulations (Bouchet et al. 1991). There 
are however conditions of validity (App.B. 3. 3b). First, at 
large scales, the g{z) and h{x) scalings merge to rebuild 
the gaussian behaviour. However, when {1 — 6) <^ 1, the 
dominant contribution to P{S) comes from the asymptotic 
region of (p{y) and the results of App.B. 3. 3b are valid. At 
small scales, the condition (B18) which reads (1 — A) ^ ^ 
is verified as soon as ^ ^ 1. Hence the number density 
of voids is described, by g{z) at all scales, provided that 
(1 — A) ^ 1. However, the scaling functions g{z) describ- 
ing small scales (i? ^ i?o) in the highly non-linear regime, 
and large scales (i? ^ i?o) in the linear regime are different 
(as the corresponding functions h{x) and (^(j/) also vary 
from the quasi-gaussian regime to the highly non-linear 
regime). 

At small scales, z is large, and g{z) is a power-law. 
Whence 



(i-A)»r"^''" 



At large scales, where z is small, g{z) decreases expo- 
nentially and we have, provided ^ is still a power-law 

(1-A)«e""^''""^ : r?„(i?) oc i?-3+'^(i+-)/(2-2-) 

X exp[-a;(l - a;)^a- (1 - A)"^ (i?/i?o)^] 

Thus the multiplicity function of underdense regions 
is steadily decreasing, and is a power-law with a cut at 
a scale close to the correlation length i? ~ i?o where the 
averaged matter correlation function is unity, as we can 
see on Fig.2. 



\og[VoVu{R)] 










^^v^^- . 1 




! 




1 \ 
1 \ 
1 \ 

^0 \ 




\og{R/Ro) \ 



-1.5 



-0.5 







Fig. 2. Behaviour of the multiplicity function of underdense 
regions ??«(-R) dR/R predicted in the non- linear approach. 
The graph shows Vb r)u{R) with Vb = 47r/3 Rq in the case 
w = 1/2, 7 = 1.8, A = 0.9. 



As for the over densities, there is another possible def- 
inition for these underdense regions, that stems from the 
consideration of the surroundings of a given cell. The prob- 
ability for a cell to be underdense at scale i?, and not at 
scale R+dR, is of order dR and may be identified with the 
volume fraction occupied by underdense regions of scale 
R: 

P{< -A, R; > -A, R + dR) = /„(< -A, > -A; R)dR 



-r>i: 



dS I dS' P{S, 5') 

-1 J -A 



(13) 



The relation with the previous definition can be inferred 
from the idendity 



P(< - A, > - A) = P(< - A, R) - P{< - A, R + dR) 
+P{> -A,<-A) 

The number density of underdense regions is then 



(14) 



Vu{R)^ = /«(< -A,> -A;i?) dR 



(15) 
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that scales with a function G{z) which has the same be- 
haviour, except for a normalization constant, as the func- 
tion g(z) wc introduced previously (sec App.C and (C28)). 
Note that because of (14) we have G{z) > g{z), and 
J^Vr]y{R)dR/R > 1. This is natural, since such a pre- 
scription docs not solve the "cloud-in-cloud" (or "void- 
in- void") problem entirely, and one may count the same 
volumes several times, but since G{z) and g{z) have the 
same behaviour and wc expect V r]y{R) clR/R to be 
close to 1 (see App.C and (C28)), it is an acceptable so- 
lution to this problem. 

2.5. Galaxy voids 

A continuous density field may be sampled by elementary 
points with number density n. All the above considera- 
tions are valid provided the number of elementary points 
per cell is large. This condition, at the non-linear scales, 
translates into (1 - A)nyJ(i?)-'^/(i-'^) » 1. When this 
quantity is of order unity, or smaller, discreteness effects 
become increasingly important. This is most often the 
case when the galaxy distribution is considered. Indeed, 
for typical values A = 0.9, n = 10~^ Mpc~^,a; = 0.5,7V,; 
is smaller than unity at all scales below 8 Mpc, that is at 
all non-linear scales. Incidentally, note that the counts of 
the overdensities using the continuous density field con- 
cepts that we have presented above, on the other hand, 
require (1 -|- A)nF^(i?) to be large. For A = 9 and the 
same galaxy number density, this is the case, provided 
one is not too demanding, at nearly all interesting scales 
provided R > 0.3 Mpc. 

It is possible to extend the above considerations about 
the void distribution to the discrete case by considering 
the probability P{N,R), or simply P{N), for counting N 
galaxies in a cell of scale R. A partition of the universe 
gives a volume fraction of empty cells 

F,{R) = P(0, R) = eM-vWc{RMm (16) 
with 

Nc{R) = nVl{R) 

The fraction of cells corresponding to voids at scale R and 
not above is then —d/dR [Fy[R)] dR and the number of 
such cells gives the number of voids 



(17) 



The alternative definition obtained by considering the 
edge of an empty cell implies to construct the probability 
P(0, R;> Q,R + dR) for having nothing in the cell of size 
R and at least one point (or one galaxy) in the larger cell 
of scale R + dR. Since 



the two possible definitions are in this case the same: ob- 
viously there is no "void- in- void" problem if we consider 
regions truly empty of galaxies. The multiplicity function 
in this case is also a power-law with an exponential cut, 
as we can see on Fig. 3, but with exponents that bear no 
relation with the continuous case (see Fig. 2 ). Moreover, 
the cut is at Ry and not Rq. 



log[3iVo] 








1 


^(x 7?-"(3-t) 




1 
1 






1 

Ro 




















log(7?/7?o) \ 



-2 -1 

Fig. 3. Behaviour of the void multiplicity function 

rjviR) dR/R. The graph shows Vb VviR) in the case 
w = 1/2 , 7 = 1.8 , iVo = 21. 



We define the quantities: 
Vo = 47r/3 Rl, No=nVo,N = nV , 



Rv — Rq 



l/(3-F7w/(l-w)) 



With n = 10~^ Mpc^-^ , Rq = S Mpc, we have Nq ~ 21. 

At small scales, where ^{Nc) N^^l, the behaviour 
is dominated by Poisson statistics 



R<^ Rc : VviR) - 3n 

At intermediate scales, where <f{Nc)/^ 
it transforms into a power-law: 



N, 



l-Ul 



« 1, 



Rc<^ R<^ Rv ■■ Vv{R) oc R~ 



.(3-7) 



while at large scales, where (p{Nc)/£, 
decreases as 



M-'^ > 1, it 



P(0, R;>(),R + dR) = P(0, R) - P{0, R + dR) 



(18) 



R:^ Ry : f]y{R) cx i^-'^fs-T) cxp[-{R/ Ry)^~'^^^~'^'>] 

The scale Ry may thus be interpreted as the typical size 
of voids. It depends on the number density n and on 
the correlation length Rq, that explicitely are functions of 
the magnitude limit of the galaxy sample considered. The 
value of Ry and its variations according to the magnitude 
limit of the sample, and the associated scaling of P(0, R), 
have been extensively discussed by Maurogordato et al. 
(1992), and follow very closely the theoretical predictions. 
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3. Evolution of the correlation function 

To get the non-linear mass function at any time t once we 
are given an initial spectrum of the density fluctuations, 
or alternatively the correlation function of the present uni- 
verse, we need to evaluate the correlation function at this 
time t. The conservation of pairs of particules gives an ex- 
act equation for ^(.x, a) ( Peebles, LSS), from which Hamil- 
ton et al (1991) followed by Padmanabhan (1996), argued 
that ^{x,a) should depend simply on the linear correlation 
function ^^(xz,, a) evaluated at a different spatial point xl 
but at the same time t or expansion factor a= {1 + z)~^: 



i{x,a) = F (,L{xL,a) 
l+e(a;,o) 

where x and xl are comoving lengths and 



(19) 



£,{x,a) 



)y^dy 



(20) 



Such a dependence has been checked in numerical sim- 
ulations by Hamilton ct al.(1991), Jain et al.(1995) and 
Padmanabhan (1996), which showed that the function F 
had also a small dependence on the form of the initial 
spectrum of the density fluctuations, that we shall, in a 
first step, ignore here. However, in all cases F{(.l) — £,l 
for small |_l and -F(Cl) « for large when | > 200. 

The reason why the Hamilton et al.(1991) approach 
works can be traced back to the hierarchical clustering pic- 
ture (Peebles, LSS) based on the spherical collapse model. 
In the linear regime, a density contrast S within a comov- 
ing radius xl evolves as (5 oc a. As a consequence we have 
^j^{xL,a) = < (5^ > oc a^. If we note J^o{xl)^ the initial 
correlation function calculated at a mass scale such that 
Rm = Xl and extrapolated to the present universe by the 
linear theory, we have: 



(21) 



A density contrast ^ a 'Eq(xl) will reach unity at a 
time given hy unl — '^o{xl)~^ and the radius of the 
overdensity at this time is r^vL — '^o{xl)^^ .t^. Its density 
at the time of non-linearity is of the order of the universe 
density at this epoch njvz, ~ n6(tArL). In the regime where 
clustering is stable, the excess of the number of neighbours 
within the blob over the mean is, as a function of time. 



riNL miaNL) 



nb{a) 
whence 



nb{a) 



a-NL 



(22) 



(23) 



In both regimes ^ is a function of = ^o{xl)- The 
non-linear correlation function ^ is taken at a comoving 
length x given by the conservation of the number of pairs 
since there is a similar conservation law (Peebles, LSS) for 
^ which relates linear and non-linear scales. In the linear 



regime, where ^ <C 1, we have ^ ~ and 



XL- In 



— — 3/2 

the non-linear regime, ^ > 200, we have C ~ ■Cl and the 

1/3 

lenght scales are linked bya;~a;z,ajvL/a~a;z,^ 
Hence we recover the behaviour given by (19), so we write 
a similar relation for ^: 



(24) 



'£,{x,a) = F \^L{xL,a)] 

xl = [l+'^{x,a)] x^ 
If we consider the physical length r we get, using (21), 

l{r,z) = F{{l + z) 



-2v2 
^0 



which is an implicit equation for ^(r, z 
lations (Jain et al.l995) show that 



(l + z)r[l + ^(r,z)]i/3]} (25) 
Numerical Simu- 



la 



.?» 200:?^(i£)^?r 



(26) 



where a ~ 1, see App.E. 



The function F can be obtained from numerical sim- 
ulations. It may also be modellized in the spirit of the 
spherical collapse picture detailed above if we know i) the 
relation between ^ and 5, which changes from ^ ^ 5^ in 
the linear regime to ^ ~ (5 in the non-linear case, and ii) 
the evolution of S, which can be traced using the sphc;ric;al 
collapse model (App.D) according to the ideas underlying 
the above qualitative picture. Such a model is presented 
in App.E, based on the sphc;ric;al collapse picture. It is 
calibrated on the numerical results obtained by Jain et 
al.(1995), and it allows one to get a relation for the non- 
linear correlation function in the case of an open universe 
n < 1 , A = which is consistent with the behaviour seen 
by Peacock & Dodds (1996) in simulations. Note that in 
this case the simple relation — C is no longer valid and 
one gets ^ = F (^^,0). More precisely, we have: 



^ « 1 : C ^ $L 

?» 200:e^(3^)'ef' 



(27) 



where d{Q) oc D{t)/a (with d ^ 1 for f2 ^ 1 and t 0), 
see App.E. Prom this model for the evolution of ^, and the 
scaling fimction h{x) measured in simulations, we can now 
obtain the mass functions of any astrophysical objects at 
all times, in the highly non-linear regime. We shall first 
compare in more detail in the next chapter this approach 
with prescriptions based on an extrapolation from the ini- 
tial gaussian field. 
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4. Comparison of the linear and non-linecir ap- 
proaches 

4-1- Counts-in- cells implied by the PS approximation 

The hierarchical picture based on the spherical collapse we 
developped in the previous chapter to get the evolution of 
the correlation function can also give the counts-in-cells, 
in the same spirit as the PS derivation seen in App.A. 
Indeed, in both cases the fundamental hypothesis is that 
one can recognize in the early universe the overdensities 
which will eventually form clusters, or more precisely one 
assumes that it is possible to follow the evolution of each 
initial overdensity (or of most of them). The dynamics 
of these objects is close to the one given by the spheri- 
cal model and. more importantly, tlicy keep their identity 
throughout their evolution: there is no redistribution of 
mass. Of course, a given overdensity will be part of differ- 
ent virialized objects as time goes on, but neighbours re- 
main close to each other, so that mass is conserved by the 
dynamics. As we recalled in App.A, this picture of gravita- 
tional clustering leads to the PS mass function (A9). The 
PS prescription also implies a prediction of the counts-in- 
cells in the non-linear regime, that may be usefully com- 
pared to the one seen in numerical simulations, as we will 
undertake now. 

4.1.1. Density field implied by the PS and stable clustering 
approximations 

It is possible to define at any time an effective density 
field that, by the same counting done by PS in the linear 
regime and done by us in the non-linear regime, would at 
any time give the number of overdensities implied by the 
PS approach. If the latter is exact, this effective density 
field should be the same as the non-linear density field. 
Obviously, this cannot be the case as all density fluctua- 
tions cannot simultaneously follow a spherical dynamics, 
and the behaviour of underdensities is known to be quite 
inacurate, but we shall consider overdense regions which 
are usually assumed to be rather well modelled by such 
an approach and examine the consequences which can be 
inferred from such a picture. 

Similarly to (24) and (26) we get from simply applying 
the spherical collapse rules that an overdensity Sl at a 
comoving scale Rm transforms into an overdensity 5 at 
scale R: 



^Rl, = {1 + 6)R^ 

where we considered the case f2 
dependence), and satisfies: 

\Sl\ « 1 : H^l) ^ Sl 

5^ » 1 : :F{5l) ^ iff 61 



(28) 



= 1 (no explicit time 



(29) 



Note that we took a = 1, as we consider here the evolu- 
tion given by the usual spherical collapse model, with no 
kinetic energy at the time of maximum expansion, in or- 
der to follow closely the usual PS approximation. Within 
this framework, the mass fraction above a given density 
contrast 5 at scale R is: 



F„(> 6,R) = F„o(> J'-\S),Rm) 



(30) 



where Fmo{> Sl,Rm) is relative to the early universe, 
that is to the linearly extrapolated values. Of course, this 
formulation gives back the PS prescription when overden- 
sities are counted directly in the non-linear density field 
constructed in this way. 



Fm{> A^,R) = Fmo{>Sc,RM) with (1 + A^) i?^ 



-flM 



where 4 = 3/20 {nirf/^ ~ 1.68 and 1 4- = IStt^ are 
the density contrasts at the time of collapse given by the 

usual spherical model. Then, the mass fraction in the form 
of just virialized objects (i.e. with a density contrast Ac) 
is: 

l^{M)^ = -^F^{>K,R)dR 



d 



■■Pmo(> 5c,RM)dRM 



ORm 

which is just the PS mass function. We can write (30) as: 

/.F-i(5)/E(ii„) 



POO 

Fm{>5,R) = 



2-K 



(31) 



The mass fraction in cells of scale R with a density con- 
trast 6 to 6 -\- d6 is: 



(1 + 5) Pps{6) dS = --^^ F„(> 6, R) d6 



(32) 



defining Pps{S) as the approximate probability distribu- 
tion of the density contrast within cells of size R implied 
by the PS approach. Thus we obtain for the counts-in- 
cells: 



PpsiS) 



1 



1 d 



2^1-\-5dd 



T-\5) 



^(Rm) 



exp 



Naturally, by construction we recover all the mass of the 
universe /(l-h(5) Pps{5) dS = 1 (indeed :r-i(-l) = -oo). 
In fact, half of the mass is in overdensities and the other 
half in underdensities, since overdensities (underdensities) 
remain overdensities (underdensities) forever {!F{0) = 0). 
However, in general this probability distribution is not 
correctly normalized: / Pps{6) dd ^ 1 (but in the linear 
regime, ^ — > and S 0, its normalization tends to 
unity). This is linked to the fact that the Press-Schechter 
approximation (using the usual spherical dynamics) can- 
not describe underdensities (which are generally taken into 
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account in an ad- hoc fashion by the factor 2). In the 
regime \6\ <^ 1, we have J^~^{S) ~ d, since S ~ 6l, and 
Rm — R, so we recover the gaussian: 



\S\ « 1 : PpsiS) 



277 



(33) 



For large overdensities 5 > Ac wc saw that J^(Sl) ~ 
iff 51, so T-^{5) ~ 3/10 5^/^. We consider the case 
where the power-spectrum is a power-law: P{k) oc fc" and 
S(Z)2 oc Then we have: 



3 5(5+n)/6 

Y.[{1 + 5Y/^R] ~ To 11{R) 

We can also get the correlation function ^,{R) at the scale 
R from (26), which leads to: 

S(i?) = ^ |(i?)(5+")/*5 when e>Ac 



Hence we get: 
<5l 



V = 



(5+n)/6 



S(i?M) S[(l + (5)l/3i?] 

and eventually, for ^ > Ac and 6 > Ac : 



P (x\ 1 1 5 + r 



X exp 



(n-7)/6 



2a^ (?) 



(5+n)/3' 



(34) 



We can note that we recover exactly the form (BIO) we 
had for the non-linear prescription, with the same param- 
eter x: 



1 



6 



Pps{^) = Z2 hps{x) with X = = 



(35) 



since in the regime of large overdensities we consider here 
(1 -|- ^) ~ 5. Hence the probability distribution exhibits 
the same scaling law with 



27r 6a 



This is due to the fact that the usual Press-Schechter scal- 
ing parameter v can be expressed as a function of the sole 
non-linear scaling parameter x: 



a 



(37) 



and the mass fraction in objects within a given range of 
mass is simply: 



M(M) 



dM 



1 



27r 



v e 



-1/2/2 '^^ 2 , . . dx 
' — =x^ hps{x) — 



(38) 



The existence of the relation i^ — x is made possible thanks 
to two effects: i) in this range ^ ~ ^+ so ^ and ^ follow 
the same evolution, and ii) the power-spectrum is a power- 
law. We can notice that these functions hps{x) satisfy the 
normalization 



j X hpsix) dx = - 



(39) 



Naturally, this expresses again that overdensities only con- 
tain half of the total mass, as we saw above. The nor- 
malization of hps{x) is "correct" in this sense, although 
(35) is valid only for 5 > Ac, because all overdensities will 
eventually reach and grow beyond this threshold. Hence at 
large times the formulation (35) is valid for all overden- 
sities, except a vanishing fraction. To cure this problem 
we multiply arbitrarily the PS formula by a normalization 
factor of 2, a procedure that, as discussed earlier, is not 
justified. In this case the mass function is assumed to be 
given by the function 2 hpsix). 

4.1.2. Moments implied by the PS approximation 

Thus, the effective PS density field has been seen in the 
previous section to scale in the same way as the actual non- 
linear field (for large overdensities), but to differ quanti- 
tatively as is readily shown by the normalization prob- 
lem. When the PS mass function is multiplied by the 
usual (but rather arbitrary) factor 2, we can calculate the 
scale- invariant coefficients Sp implied by the above density 
field since all the mass is now described by the function 
2 hps{x). Then we can compare them with the values 
obtained in numerical simulations to describe the differ- 
ence between these two density fields. Indeed, using (B13) 
where we replace h{x) by 2 hps{x) we get: 



6p - 



1 



2{5 + n) 



(40) 



1 , 6P+.1-1 

p>l : Sp = ^= (2a2)WTT 

Naturally 5i = 1 since we normalized the mass function 
to unity, but generally we get 5*2 ^ 1. This contradiction 
(since / x"^ h{x) da; = 1 by definition of ^) means that one 
cannot simply multiply the PS mass function by a factor 2 
to get an acceptable mass function, as the additional con- 
straint 52 = 1 has to be fulfilled. Thus the PS density field 
is not only inexact for the overall density it predicts in the 
clustered regions, but also for the correlation function it 
implies (after correction by the factor 2) which is slightly 
different from the actual non-linear correlation function. 
This shows very clearly that the PS formula, even with a 
corrected normalization, cannot describe exactly the mass 
function of non-linear objects, and does not give the ex- 
act value of the parameter a. Naturally, it is possible to 
"correct" the evolution of individual overdensities by a pa- 
rameter a' , as we did for ^, such that ^2 = 1, which means 
that we would get the correct two-body correlation func- 
tion ^. However, such a refinement is certainly illusory, as 
such a simple model is probably too crude to be cured 
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significantly by such a small correction. For completeness, 
table 1 gives the coefficients 5p, for p < 5, obtained in this 
way, compared with those given by numerical simulations 
from Colombi et al. (1995). 



cutoff of hps{x) becomes even sharper, which encreases 
even more the difference with the actual scaling function 
h{x). 



Table 1. Coefficients Sp from the density field implied by the 

PS prescription and from numerical simulations (in parenthe- 
sis), for various indexes n of the power-spectrum. Note that 
52 = 1 by definition. In the PS case, has been readjusted to 
unity; without this modification, all values of Sp given by this 
model should be divided by 2. 



n 


52 


53 


54 


55 





1.25 


2.78 


8.31 


30.3 




(1) 


(3.54) 


(19.9) 


(158) 


-1 


1.10 


2.62 


9.10 


40.5 




(1) 


(5.62) 


(56.2) 


(891) 


-2 


1.19 


4.23 


25.2 


209 




(1) 


(11.2) 


(316) 


(12589) 



We can see that the coefficients Sp we obtain are quite 
far from those given by numerical simulations. This means 
that the PS mass function must differ appreciably from 
the actual one, and that a correction by a factor a' to get 
^2 = 1 cannot be sufficient. 

4.1.3. Numerical comparison of the non-linear and the PS 
scaling functions 

Indeed, the functions hps{x) we get from (36) have a dif- 
ferent shape from those we considered in the context of 
the non-linear approach. We still have a power-law multi- 
plied by an exponential cutoff, but this cutoff is no longer 
a simple exponential but the exponential of a power-law. 
In terms of the function tp{y) it means that for n > —2 
the singularity ys is repelled to infinity ys = —oo while for 
n < — 2 it goes to 0: ys = 0. However, in the case n = —2 
we get: 



n = — 2 : hps{x) 



1 



2.18\/27r 



c/2.38 



(41) 



which corresponds to w = 1/2 , = —1/2 and x* = 
2.38. We can note that this value of x^, is much smaller 
than the value a;* ~ 18 obtained by Colombi et al. (1996) 
in numerical simulations. In fact, the functions h{x) we 
get from (36) differ from those foimd by Colombi et al. 
(1996) by several orders of magnitude after the cutoff, see 
Fig. 4. We can note that for all these power-spectra, the 
functions h{x) have a shallower power-law for small x, and 
a smoother cutoff at large x. Moreover, if we "correct" 
hps{x) by a factor a' > 1 so that 52 = 1, the exponential 



\og[x''h{x)] 


^^^\ 


■ / n = 1 


\ 

'i\>\\ 

'"\'.\ \ 
" I. \ \ 


^1* 


log(a;) ;'\\\ \ 



-2 2 

Fig. 4. Scaling functions h{x) given by numerical simulations 
(solid line) and the PS prescription (dashed-line), in the cases 
n = —2, n = —1 and n = 0. We multiplied the PS result, 
eq.(36), by the usual factor 2 so that the normalization is the 
same for both functions. The exponential cutoff of h{x) gets 
sharper as n increases. 



Fig. 4 shows the scaling functions h{x) given by nu- 
merical simulations and the PS prescription, for different 
indexes of the power-spectrum. The difference between the 
two approaches is of the same order as the difference ob- 
tained by varying the initial spectrum of density fluctua- 
tions. This implies that conclusions derived for the latter 
using the PS formulation should be reexamined. 

4.1.4. Discussion of the PS approach 

The important differences which we have seen to exist be- 
tween the non-linear and the PS based formulations cast 
some doubts on the latter. Indec;d, the fact that the high 
tail of the functions h{x) we get is not consistent with 
what is observed, that we do not recover the power-law 
h{x) oc x'^^'^ for small overdensities in accordance to the 
condition (BIO), because of (33), that the coefficients Sp 
are far from those obtained by numerical simulations, and 
that half the mass remains locked in underdensities, show 
that there is some redistribution of mass during the pro- 
cess of gravitational clustering, which prevents one to fol- 
low the evolution of a given overdensity which will lose 
its identity sooner or later. Besides, the spherical collapse 
model supplemented by virialization itself may be inade- 
quate. Moreover this cannot be cured by a simple change 
of normalization, since the shape itself of the probabil- 
ity distribution has to be modified, even for large density 
contrasts. Hence it would seem qiiite surprising that the 
PS formulation predicted accurately the results of gravita- 
tional clustering. It is actually quite simple to understand 
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why the non-linear distribution (Fig. 4) is systematically 
broader than the PS one: assuming that every initial over- 
density evolves independently amounts to assume the ex- 
istence of conservation laws. Relaxing the latter constraint 
increases the available phase-space and thus yields broader 
distribution. Whence we expect this to hold also for 17 < 1 
as well as for A 0. We can note that for n < — 2 this 
would imply that ys = 0, which is however a bit surprising. 

4.1.5. 0< 1 , A = 

For an open universe we now have an additional time de- 
pendence as compared to the previous case f2 = 1. Thus, 
the system (28) becomes: 



Rli = (l+(5)i?3 



(42) 



In a fashion similar to the study of the evolution of the 
correlation function, see (27) and App.E, we obtain the 
asymptotic behaviour of such that: 



■|,5i|«l : J'{5L,a)c^5L 



(43) 



Since in the virialized regime 5 and ^ follow the same 
behaviour the pre- factor d{Vi)~^ disappears in the calcu- 
lation, and we recover: 



u=- a;(5+")/<5 if ^, ^ > 178 d{^l)-^ 
a 



(44) 



as in the case $7 = 1. Since the mass fraction in objects 
within a given range of mass still verifies by definition the 
relation (38) we get the same scaling function hps{x) as in 
the case = 1. Hence the remarks we drew in the previous 
section can be readily extended to the case of an open uni- 
verse, with A = 0. Note however that, contrary to the case 
f2 = 1, all overdensities will not reach and grow beyond the 
threshold 5 > A.^. Hence, although J x hps{x) dx = 1/2 
some overdensities will never be described by the scaling 
function hps{x). The latter corresponds to overdensities 
such that 5l > 10 and 6 > Ac(17 = 1) d{n)-^. 

Thus, the scaling function hps{x) we obtain is inde- 
pendent of f7, as long as we restrict ourselves to scales 
(^) and objects (6) which collapsed while O ~ 1. This is 
due to our hypothesis of stable clustering, which implies 
that structures do not evolve once they are formed. This is 
consistent with the fact that the dependence on redshift of 
the correlation function ^ is entirely accounted for by the 
factor (i(f7), with a constant parameter a (see Chapter 3 
and App.E), as shown by numerical simulations (Peacock 
and Dodds 1996). 



4-2. Analytical comparison of the PS and the non-linear 
mass functions 

4.2.1. Direct comparison of analytical expressions 

Although the above considerations on the non-linear den- 
sity field that would give exactly the PS counts arc instruc- 
tive for understanding this approximatiom, there is a more 
direct way to compare it to the non-linear mass function, 
which needs only to assume that the model (Ch.3) for the 
evolution of the correlation function (which fits well into 
the picture given by the numerical simulations) is correct. 

Using the correspondance between linear and non- 
linear correlation functions ((24) and (26)) only, we can 
directly express (for $7 = 1 and z = 0) the ratio u = 
5c/'^{M) appearing in the PS mass function (A9) as a 
function of the parameter x of the non-linear formulation. 
Indeed, by definition of the usual PS formulation we have 



3/10 



(45) 



for just-virialized objects, since it assumes a = 1. Our 
model for the evolution of the correlation function leads 
to: 



E(M) = 3Q/10 (&-^ 
This gives 



(46) 



(47) 



Thus we obtain again a relation between the PS scaling 
parameter v and the parameter x = Ac/^. Since the mass 
fraction in objects within a given range of mass still verifies 
(38) we see that the usual PS mass function (A9) for just- 
virialized objects has the same scaling as the non-linear 
one (3) with the function hps defined in the previous sec- 
tion, and a scaling parameter x = (aj/)^/(^+"). 

This consideration is important as it directly compares 
the PS prescription, which evaluates the mass function of 
just-virialized objects (A9), to the same quantity given 
by the non- linear formulation (3). Thus it does not rely 
on the assumption of stable clustering for individual ha- 
los, as in the previous sections. Moreover, the fact that 
both scaling functions h{x) and H{x) are very close (see 
App.C) shows this comparison is valid although h{x) was 
originally obtained from the counts-in-cells. 

4.2.2. Broadness of the distribution 

We have seen in the previous section on Fig. 4 that the 
slope of h{x) for small x is smaller than the slope of 
hps{x), and that the large x cutoff should be smoother. 
We have also argued that this should probably be true for 
any power spectrum or cosmological parameters 17 , A. 
This broadening obviously is translated into the mass 
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functions calculated in the two approaches. It is also read- 
ily seen on the figure that the non-linear processes, which 
are certainly the main cause for the deviations of the ac- 
tual h{x) from the PS approximation (rather than the cor- 
rections due to the cloud-in-cloud problem, as discussed 
in section 2.3), tend to broaden the scaling fimction h(x) 
(or the corresponding mass function). The position of the 
maximum of x'^h{x) is always close to x ~ 1 because of the 
definition of x, since ^ ^ 5+ in the virialized regime where 
5+ is the characteristic overdensity of non-linear objects 
at the considered scale, by definition of ^ (see App.E). 

4.2.3. Earher discussions of the PS approach 

Blanchard et al.(1992) compare the actual mass function 
with the PS approximation. These authors assume that 
the fraction of mass embedded in non-linear objects of 
mass larger than M, defined in the present case by an 
actual density contrast 5 larger than a density threshold 
Ac, depends linearly on the initial conditions as: 

f°° dM f°° 

/x(M) -j^ = s{5L,y) F{u) du (48) 

The integral F{i^)d V is the fraction of volume, calcu- 
lated in the linear approximation^ in which each point can 
be embedded in a sphere such that its mean relative over- 
density (5 = vYj{M) satisfies \v\ > Vg, I](M) being the 
r.m.s. fluctuation of ^ at scale M . The kernel s(i5, v) is a 
selection function which gives the fraction of mass in be- 
tween (5 and 8 ^ db that ends up in a non-linear object of 
mass larger than M for a threshold v. They do not give s 
or F . but discuss the mass function according to various 
assumptions made for these functions. They use for F the 
usual gaussian expression obtained from linear the- 
ory (but however argue that, because of the cloud-in-cloud 
problem F(y) may be different). Taking for granted that s 
has a sharp threshold, they show that the non-linear mass 
function is exactly given by the PS expression (but again, 
discuss also other possibilities at the faint mass end). We 
have seen that the actual mass function differs sensibly 
from the PS approximation, contrary to the conclusion of 
Blanchard et al.(1992). This is because they only consider 
the possible variations of F{yy. the sharp threshold as- 
sumption for s, that seemed at first sight innocent, indeed 
implies a one-to-one correspondance between the initial 
overdensities and the actual non-linear ones, as can be 
seen by simple derivation of (48), in the same way as has 
been originally assumed in the derivation of the PS expres- 
sion. Their derivation that the non-linear mass function is 
equal to the PS mass function is thus not a prediction, 
but a consequence of their sharp threshold assumption. 

4.2.4. Non-linear scaling of the PS approach 

Since the PS approximation appears to be plagued by 
many problems, one may wonder why we recover the scal- 
ing law (35) for large densities. This is simply due to the 



fact that in such a model overdensities are "frozen" after 
virialization: their radius and their density do not evolve 
any longer so their density contrast grows as a^. Indeed, 
the many-body correlation functions ^„ are mainly sen- 
sitive to the rare high-density peaks, and increasingly so 
with larger n. Thus we may write ^„ ^< (5" >^ ^ 77^ 5" (x 
^3(ra-i) giniilarly to (El), where r]i is the fraction of vol- 
ume occupied by overdensities Si and rn oc since the 
radius of these overdensities remains constant. Then the 
coefl[icients Sn = are constants, which implies 

the scaling (35), as we recalled in Chapter 2 and App.B. 
As a consequence, the fact that overdensities "freeze" af- 
ter they reach the threshold is sufficient to get the 
correct scaling law for large overdensities. We can note 
that such an argument may also apply to low density uni- 
verses < 1 where structure formation stops after Q gets 
sufficiently small. In fact, the scale invariance of the coef- 
ficients S„, within the highly non- linear regime, could be 
an even better approximation in this case than for Q = 1 
where structure formation never stops. Thus, Padmanab- 
han et al.(1996) find in numerical simulations that the two 
point correlation function ^ satisfies the predictions of the 
stable clustering hypothesis with a better accuracy for an 
open universe than for a critical universe. 

4.2.5. Away from the deeply non-linear regime 

As we can check on Fig. 4, the non-linear approach gives 
more numerous large overdensities than the PS prescrip- 
tion. This holds as long as ^ > Ac at the considered scales. 
When ^ becomes lower than this threshold, it suddenly 
falls to get close to the linear value ^j^. This decrease in 
^ is quite fast, as in the usual sperical coUape model it 
corresponds to ^ decreasing from Ac ~ 178 to ~ 1 while 
goes from 6"^ ~ 2.8 to 1. Hence, for a given density 
contrast 5 the parameter x — S/^ suddenly rises, so that 
the mass function given by the non-linear approach de- 
creases sharply. Moreover, when ^ < 1 the function h{x) 
itself changes to the one givcm by the quasi-gaussian pre- 
diction, which is characterized by a stronger exponential 
cutoff (smaller ) . This enhances even further the sudden 
falloff of the number of these objects. We can also verify 
this effect if we perform a comparison between the non- 
linear and PS approaches in the regime ^ 1 and 5 > Ac. 
We now have: 

^^-(2+„)/6 3a^(5+„)/6 ^^g^ 

Thus there is no longer a scaling in x, except for n = — 2. 
However, even in this case the exponential cutoff is much 
smaller since now .t* = 2(10/30;)^ ~ 22. In fact, for 
n = —2 the quasi-gaussian function h{x) has ~ 4 (see 
Bernardeau 1994), so we see that the multiplicity function 
it predicts will suddenly fall when ^ ^ 1, even below what 
the PS prescription gives. This means that in the non- 
linear picture of structure formation through gravitational 
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clustering, objects appear suddenly at a redshift such that 
the two-points correlation function taken at their scale be- 
comes larger than unity. Thus the evolution we obtain is 
much more violent than what is implied by the PS ap- 
proximation: we get more objects in the recent past, but 
they suddenly disappear at a given redshift, which defines 
a well-determined redshift of structure formation. The lat- 
ter will depend on the astrophysical objects one considers 
(more precisely on their scale). 

4-3. Mass functions 

As we recall in App.A, the comoving mass fimction for 
virialized objects, at a redshift z, given by the PS pre- 
scription (which we multiply by the usual factor 2), is: 



r){M)- 



dM 



2 Po Sc{z) 



dlnSn 



dlnM 



M V TT M So 
where, following the notations of App.D 
= 1: 5c{z) = 5co (1 + z) 
and 



M ^ ' 



O < 1 , A = : 5,{z) = -Z?(to) 



1 



\ t{z) ) 



2/3' 



If the power-spectrum is a power-law: P{k) cx A;", we can 
write So(M) = (7o(M/Mo)"("+3)/^, where ctq is the nor- 
malization of So(M) at the mass Mq, or at the radius Rq. 
Since i?o is defined by ^(i?0 7 z = 0) = 1, we have ctq — 1, as 
we are still close to the linear regime so that ^{Rq) — Sg. 
If = 1, this leads to: 



V{M) — 



X exp 



ttMo 6 ao \Mo 



(1 + ^) 



dM 



(51) 



Hence, we can see that the mass function switches from a 
power-law to an exponential behaviour at M ~ M,,ps{z) 
with 

= 1: M,ps(0) = Mo (l + ^)-^/("+3) 
and more generally: 

'ScizY 



VO : M^psiz) = Mo 



5co 



-6/{n+3) 



On the other hand, the non-linear prescription we pre- 
sented in chapter 2 leads to the comoving mass function: 



viM)— = ^ X h(x) dx with x = -= — 



(52) 



where Ac{z) is the density contrast at the time of virializa- 
tion, at the redshift ^. If O = 1, Ac 178 is a constant. If 
we are in the regime of stable clustering with a power-law 
for ^, we have: 

x = [ii+A^izm+zrr'^' i^^y 

We define: 

M,{z) = [(1 + Ae(^))(l + zf] "^'/^"'^ xl^^ Mo 
Then, for small masses M <^ M, : 

,(M) cx (^) [(1 + Ae(.))(l + zf] 



and for large masses M ^ M* : 

'AT 



^(M) cx ( ^ V'^'^^'^d + A.)(l + .)f -*)(^-^) 



X exp 



[{i + A,{z)){i + zf]' /M_y/'' 

a;* \MoJ 



The mass function bends at M ~ M*{z). 

Hence both prescriptions give similar mass functions, 
a power-law multiplied by an exponential cutoff, but their 
redshift dependence is very different: the exponent of the 
redshift term in the exponential cutoff is 2 for the PS pre- 
scription, while it is (3 — 7) = 6/(5 -|- n) for the other mass 
function, when = 1. Indeed, in the case of stable cluster- 
ing the power- law of the non-linear correlation function is 
7 = 3(3 -I- n)/(5 -|- n). Thus, for a given mass the number 
density of halos will decrease much more slowly accord- 
ing to the non-linear prescription than the PS formulation 
would imply if the index n of the power spectrum verifies 
n > —2. In the particular case n = — 2, the dependence 
on redshift is the same for both prescriptions. Moreover, 
both cutoffs M^,{z) and M^,ps{z) show the same evolution 
with redshift. This is consistent with the results seen in 
sections 4.1. and 4.2., since we noticed that the mutiplicity 
functions can be written: 



dM Po j2 1^ dv 



' M M V TT 



(53) 



for the PS approximation (multiplied by the usual factor 
2), while 



, , dM Po 2 , r \ dx 
V{M)— = ^x^ h(x) 

\ I M ^ ' X 

for the non-linear prescription, with 
1/ oc a;(^+")/*' if = 1 



(54) 



(55) 
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Hence both mass functions must evolve in parallel, and 
their relative position does not change with time: a given 
value of X corresponds to a constant value of v whatever 
the redshift. The fact that the evolution with time of the 
mass functions evaluated for a given mass varies from one 
formulation to the other, is simply due to the fact that 
their slopes are different. 

In the case of a low-density universe, the relation v — x 
no longer holds, and the multiplicity functions given by 
both approximations have a different redshift evolution. 
Indeed, for small scales where clustering is stable (that is 
^ > nM{VLy^) we have for large times (fi < 1): 



log(C) 



u{M,z) 



2So(M) 

which is independent of z while: 
a;ocM(3+»)/(5+»)(i+2)4/(5+n) 



(56) 



(57) 



Thus, at large times (1 + z) and .x ^ for a 
fixed mass M and linear parameter v. This means that 
the small mass end of the multiplicity function given by 
the non-linear approach decreases with time relative to 
the PS prediction. However, in the intermediary regime 
where 1 <i; ^ ^ 178rf(f2)~^, one can show that within 
the description presented in App.E, the scale defined by 



V oc X 



3/2 D{t) verifies: 

(3+n)/6 



(58) 



Hence both multiplicity functions remain close to each 
other, and follow a similar evolution at these scales. This 
is natural, since both mass fractions have the same nor- 
malization (we must recover the average comoving density 
of the universe) and their peak is close to v = 1 and x = 1 
by definition of S and ^, which correspond to similar ob- 
jects because of the relation — ^. 

4-4- Numerical comparison of the mass functions 

We explained in the previous chapter how we evaluate the 
average two-body correlation function ^ at any redshift 
once we are given an initial power spectrum. Then, one 
has to "choose" a function h{x), which can for instance 
be obtained by a fit to the observed present mass function 
of clusters. The function h{x) has been obtained (Bouchet 
et al 1991) in a simulation with CDM initial conditions. 
In the case the primordial power-spectrum is a power-law, 
P{k) (X fc", Colombi et al. (1996) showed that different 
values of n led to different functions h{x) in the deeply 
non-linear regime. Hence, we can study the mass functions 
for various initial conditions using these simulations. 

The upper graph of Fig. 5 shows the non-linear correla- 
tion function ^ in the case n — —1, CI = I. We can see that 
it is very close to the observed power-law ^{R) oc R~^-^ in 
the range we consider presently. Fig. 5 also displays both 



\og{R/Ro) 




-6 -4 -2 

Fig. 5. Upper figure: The evolved correlation function $^{R,z) 
(solid line), and the linear extrapolation S(i?)^ (dot-dashed 
line), in the ease Q. — 1 , P{k) oc (n — —1). The short 
dashed line is the power-law (R/Rt))^'' with 7 = 1.8. Inter- 
mediate figure: Comoving mass function rj(M) dM/M. The 
graph shows the quantity Vb ri{M), for the non-linear prescrip- 
tion (solid line), with ui = 1/2 , = 5 , Ws = —1-1, and 
for the PS prescription (dashed line). The vertical solid lines 
show the position of M«, while the vertical dashed lines show 
the position of M^ps, for both redshifts. Lower figure: Mass 
fraction ^l{M) dM/M for the same case. 



mass fractions iJ,{M)dM/M (for the PS and non-linear 
prescriptions) in the same case n=— l,r2 = l, at the 
redshifts z = and z = 3. As we explained in the previ- 
ous section, both mass fractions follow the same evolution 
with redshift, and their relative position remains constant. 
Contrary to what could be inferred from section 4.1. and 
Fig. 4 the non-linear prescription docs not lead to many 
more halos than the linear formulation after the cutoff (at 
the large mass end). This is due to the fact that the analy- 
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Fig. 6. Upper figure: The evolved correlation function ^{R, z) 
(solid line), and the linear extrapolation E(Ji)^ (dot-dashed 

line), in the case = 1 , P{k) oc (n = ^2). 

The short dashed lines are the power-laws (R/Rq) ' with 
7 = 1.8 and 7 = 1.4. Intermediate figure: Comoving 
mass function r;(M) dM/M. The graph shows the quantity 
Vo ^/(Af), for the non-linear prescription (solid line), with 
ui — 0.3 , x„ — 18 , LJs = —1.6, and for the PS prescrip- 
tion (dashed line) . The vortical solid lines show the position of 
il/, , while the vertical dashed lines show the position of M,ps, 
for both redshifts. Lower figure: Mass fraction ^,{M) dM/M 
for the same case. 
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Fig. 7. Upper figure: The evolved correlation function ^(i?,2;) 
(solid line), and the linear extrapolation E(J?)^ (dot-dashed 

line) , in the case n = 1 , P(fc) oc fc° (n = 0). The short dashed 
line is the power-law (R/ Ro) ' with 7 = 1.8. Intermediate fig- 
ure: Comoving mass function ri{M) dM/M. The graph shows 
the quantity Vb ri{M), for the non-linear prescription (solid 
line), with w = 0.65 , = 2 , Ws = —0.575, and for the 
PS prescription (dashed line) . The vertical solid lines show the 
position of Af, , while the vertical dashed lines show the posi- 
tion of Mtps, for both redshifts. Lower figure: Mass fraction 
lx{M) dM/M for the same case. 



sis we developped above breaks down in this region where 

^ < Ac. Indeed, for these small values of ^ the clustering 

— — 3/2 — . 

is not stable: wc do not have ^ cx and ^ is smaller 

than the value it would have if this relation were still valid 
(as can be seen on Fig. 12 which clearly shows the sudden 
fallofF in the relation which translates into a similar 

break for ^(^2.))- This decrease in ^ implies an increase in 
X which leads to a stronger cutoff than the one implied 



by our previous analysis. The net result is that both mass 
fractions are very close in this domain, especially in the 
case n = — 1. This implies for instance that the multiplic- 
ity functions of clusters of galaxies given by both prescrip- 
tions will be very close. Naturally, although the relation 
j^" — a; we obtained in the strongly non-linear regime does 
not hold any longer for these weakly non-linear scales, the 
evolution with redshift of both mass fractions is still sim- 
ilar in this domain. Indeed, since the power spectrum is a 
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Fig. 8. Upper figure: The evolved correlation function ^{R, z) 
(solid line), and the linear extrapolation E(Ji)^ (dot-dashed 
lino), in the case = 1 for a CDM power-spectrum P(k). The 
short dashed line is the power-law (R/Rq) ' with 7 = 1.8. In- 
termediate figure: Comoving mass function rj{M) dM/M. The 
graph shows the quantity Vb ri{M), for the non-linear prescrip- 
tion (solid line), with uj = 0.4 , x* = 12.5 , ujs = —1.4, and 
for the PS prescription (dashed line). The vertical solid lines 
show the position of M, , while the vertical dashed lines show 
the position of M,ps, for both redshifts. Lower figure: Mass 
fraction fj,{M) dM/M for the same case. 
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Fig. 9. Upper figure: The evolved correlation function i^{R,z) 
(solid line), and the linear extrapolation S(i?)^ (dot-dashed 

line), in the case fio = 0.3 , A = , P{k) oc (n = -2). The 
short dashed line is the power-law {R/ R))^"' with 7 = 1.8. In- 
termediate figure: Comoving mass function rj(M) dM/M. The 
graph shows the quantity Vb ri{M), for the non-linear prescrip- 
tion (solid line), with ui = 0.3 , = 18 , Ws = —1.6, and 
for the PS prescription (dashed line). The vertical solid lines 
show the position of M, , while the vertical dashed lines show 
the position of M,ps, for both redshifts. Lower figure: Mass 
fraction /u(M) dM/M for the same case. 



power-law the shape of the mass fraction does not depend 
on redshift (for any formulation). As both mass fractions 
evolve in parallel in the highly non-linear end (which cor- 
responds to small scales, and in the present case to small 
masses), all of their features evolve in parallel. This also 
means that there is a relation v — x, independent of z, 
for all values of v and x. In fact, as wc noticed in 2.1., 
the non-linear scaling h{x) breaks down for small ^ where 
we enter the quasi-gaussian regime. We can note that for 



other astrophysical objects defined by larger density con- 
trasts, the analysis we developped in section 4.1. for the 
highly non-linear regime holds for large x and v. Thus, 
the shape of the mutiplicity functions given by both ap- 
proaches follows the comparison shown on Fig. 4, and the 
non-linear formulation will lead to many more extreme 
objects located beyond the cutoff than the PS extension. 
This will be the case for instance for galaxies, since the 
cooling constraint will imply a high density contrast for 
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bright galaxies at low redshifts. This point is developped 
in detail in Valageas & Schaeffer (1997). Hence, although 
both prescriptions may lead to very close results for some 
objects like clusters, they can differ widely for others. 

In the case n = — 2, = 1, the mass functions are 
different for nearly all masses. Fig. 6 shows the evolved 
correlation function, which can still be approximated by a 
power-law, but with a shallower slope 7 = 1.4. It also dis- 
plays both mass fractions n{M)dM/M at the same red- 
shifts z = Q and z = 3 as previously. As for the case 
n = — 1, both multiplicity functions are very close at the 
high mass end. 

Fig. 7 shows similar results in the case n = 0. 

Fig. 8 shows the evolution of the correlation function 
from 5](r)^ to ^(r) for a CDM power-spectrum in a critical 
universe. One can see that it is somewhere between the 
cases n — —\ and n = — 2 at the scales we are interested 
in. Since the power-spectrum is no longer a power-law in 
this case, following Jain et al.(1995) we define an effective 
index neff{z) to get the correction a{neff), such that: 



fd\-aP{k)\ 



with ^i(re) = 1 



= l/re 



Fig. 8 also shows the mass functions r]{M)dM/M and the 

mass fractions ii{M)dM / M in this case of a CDM power- 
spectrum with n = 1. As the power-spectrum and the 
correlation functions S(r) are no longer power-laws the 
shape of the curves fi(M)dM/M changes slightly with the 
redshift, as well as the relative position of both multiplicity 
functions. 

Finally, we consider the case fig ~ 0.3 , A = 0, which 
is shown on Fig. 9. We take the case n = — 2 as an example, 
which shows the qualitative similarity with the case O = 
1. As is well-known, the evolution with redshift of the 
multiplicity functions is slower than for a critical universe 
(compare to Fig. 6). As we explained above, both mass 
fractions do not follow exactly the same evolution, as was 
the case for 12 = 1, but they remain very close to each 
other around their maxima and their cutoff. Other initial 
spectra would show analoguous features. 

As a summary, the PS mass functions, which result 
from counting the overdensities in the linear regime with 
a prejudice for which ones are going to end up as clusters, 
and the non-linear mass functions, which directly count 
the overdensities at the epoch of interest, differ signifi- 
cantly, but somewhat less at the large mass end where oc- 
casionnally the two are identical. All examples given here 
were for overdensities of ~ 200 (and ~ 400 for Oq = 0.3), 
as is customary when describing "virialized" objects. To 
describe the mass function of galaxies (Valageas and Scha- 
effer (1997)), overdensities of ~ 10^ are required. In the 
latter case the difference between the PS and the non- 
linear mass function is much larger, especially at the large 
mass end. 



5. Other astrophysical objects 

So far, we have only considered the mass functions of 
dark matter halos defined by a constant density contrast 
threshold A. We mainly focused on A = Ac(z) which cor- 
responds to just virialized halos (according to the usual 
spherical collapse criteria), but our analysis performed for 
the non-linear density field (characterized by '•p{y) or h{x)) 
applies to any constant density contrast threshold, which 
allows one to study underdense regions too, as in section 
2.4. However, astrophyHic;al objects are not always defined 
by a constant density threshold. For instance, galactic ha- 
los also need to satisfy a cooling constraint (together with 
the usual virialization condition) which leads to a spe- 
cific relation A{R) (or A(M)) which defines the external 
boundary of these objects of a given mass (through the re- 
lation R{M) it implies), as is studied in detail in Valageas 
& Schaeffer (1997). The relation A(i?) is usually different 
from a constant: for instance in the case of galaxies one 
gets a constant density contrast at small masses (the usual 
virialization condition), which transforms into a constant 
radius at large masses (asymptotic cooling constraint at 
high temperature), which can be written as (1 + A) oc M. 
In a similar fashion, the modellization of Lyman-a clouds 
leads to a non-constant relation A(J?). As a consequence, 
it is important to look in detail what mass functions can 
be obtained when objects are no longer defined by a con- 
stant density threshold but by a more general relation 
A(i?) or A(M). This is the aim of this chapter, where 
we only consider the non-linear prescription based on the 
scale invariance of the function ip{y). Indeed, this is be- 
yond the reach of the "linear" formulation (which cannot 
model underdense areas) except in the domain where it 
can be recast in the "non-linear" terms with a scaling 
function 2 hps{x), as was developped in Chapter 4. 

5.1. Counts-in-cells 

We consider astrophysical objects defined by a function 
(1 -|- A)(i?), and we note: 



dln{l + A) 
dlnR 



(59) 



Hence the usual virialization condition corresponds to (3 ■ 
0, and a constant /3 means: 



if /? = constant : (1 A) oc R~^ , x (x R~ 



0+1 



(60) 



where we used ^{R) oc R~^ by definition of 7, and x = 
(1 -|- A)/^ is the usual non-linear parameter. 

We first consider the case /3 < 7. As we did in section 
2.1, we can define from the counts-in-cells the fraction of 
matter with an overdensity larger than (1-|-A)(i?) at scale 
R: 



Fmi> A{R),R) = / xh{x)d 

Jx{R) 



(61) 
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and use for the fraction of matter n{R)dR/R embedded 
in objects of scale R - R + dR: 

= -^Fm{> HR),R) dR = xh{x)^ dR (62) 
while the mass function is: 



local value of the function (1 + A)(i?) which has a different 
meaning. Hence we define the mass fraction by: 



^^{R) 



dR 
R 



d_ 
dR 



Fm{< A,i?) dR = -xh{x)dx 



while the mass function is: 



viM)j^ = ^xh{x)dx 



with 



M = (l + A)po^ (63) rj(M) — 



-^xh{x)dx 



(64) 



(65) 



Whatever the relation {1+A){R) this mass function is nor- 
malized to unity: / ii{R)dR/ R = 1. Wc can note that the 
case ii=constant for the definition of the objects, which 
corresponds here to /? = — oo, is not different from other 
cases in the final expression for the mass fonction. It sim- 
ply means that R is not a good variable, but one can 
still calculate the mass fraction dM. = xh{x)dx and the 
multiplicity function dM = po/M xh{x)dx, which can be 
expressed in terms of the mass of the objects M. Note 
that we can always use the variable x to count objects, 
since /? < 7. 

Now we look at the case /3 > 7. Since by construc- 
tion we have dR > in (62), and dx/dR < 0, we get a 
mass fraction which is negative if we apply directly (62). 
This is due to the fact that what wc have to consider 
now is no longer Fm(> A{R),R) but F„(< A{R),R). 
This is obvious if the density profile of typical halos (or 
more exactly of the cells) is p oc R~^. Indeed, a halo has 
a radius between R and R + dR if the curve (1 -|- A)(r) 
intersects the actual halo profile (1 -I- S){r) at a scale r 
between R and R + dR. If the halo profile is p oc , 
and f3 > ^, the curve (1 + A)(r) is steeper than the 
halo density profile and the intersection only occurs be- 
tween i? and i? + dR if (1 + d){R) < {1 + A)(i?) and 
{l + S){R + dR) > {l + A)iR + dR). Hence, if ahalohasa 
size larger than i? it satisfies (1 +(5) (i?) < (1 + A)(i?). Note 
that the curve (1 + A)(i?) which defines the boundary of 
the dark matter halos is not the physical constraint which 
applies all along a given halo. For instance, in the case of 
galaxies (see Valageas & Schaeffer 1997), the cooling con- 
straint translates into a cooling radius Rcooi{T) defined 
by a temperature - density relation Pcooi{T). That is, if 
we consider to simplify that halos are isothermal (7 = 2), 
for a given halo defined by its virial temperature T all 
the matter inside the radius Rcooi{T) where the density 
is larger than Pcooi{T) will be considered as part of the 
galaxy (because the cooling time in these regions is small). 
If we express the relation Pcooi{T) as a relation (1 + A)(i?), 
we obtain a curve which is very steep: /3 > 3, and as a 
consequence in the inner regions of the halo R < Rcool we 
have (1 -I- 5){R) < (1 -I- A)(i?) while in the outer regions 
R > Rcool we get (1 -t- d){R) > (1 + A)(i?). However, con- 
trary to what the sign of these inequalities may suggest 
the density within the radius Rcool is larger than what is 
required by the cooling constraint: one has to compare the 
local density to the fixed number Pcooi{T), and not to the 



As in the previous case, /? < 7, the particuliar cases 
R =constant (/3 = -l-oo), or M =constant (/? = 3), present 
no difficulties, and one can always use a; as a variable. 

Thus, in all cases we can use for the mass fraction dM. 
and the mass function dAf : 



dM = xh{x)\dx\ and dAf = j^xh{x)\dx\ 



(66) 



Within this framework, we cannot define a mass function 

in the case /3 = 7, where x =constant. This can also be 
understood if halos have a mean profile in p (x R''^, since 
in this case it is obvious that the criterium (1 -|- A)(i?) oc 
cannot be used to define objects. However, such a 
picture may be misleading, and a more correct statement 
is simply that the mass fraction embedded in cells which 
lie above a certain density threshold A depends only on 
the parameter x. Then, if /? = 7 the derivative of the mass 
fraction is zero: dFm{> A,R)/dR = dFm{< A,R)/dR = 
0. This also suggests that the mass functions obtained in 
the case [i ~ ^ may not be very accurate, but fortunately 
the c;ases of practical interest do not cross this domain: 
/? = for usual virializcd halos and /? > 3 in the case of 
galaxies or Lyman-a clouds. 

We can note however that the previous approximation 
for the mass function of objects defined by a curve (1 -I- 
A){R) is not entirely satisfactory in the cases /? < and 
/? > 7. Indeed, contrary to the usual case A =constant, 
there is no guarantee that the density of the "objects" 
picked out in this way decreases at larger radius, even lo- 
cally at the considered scale R. One may even fear that 
such a mass function counts mainly "valleys" (cells with 
a density which increases when averaged over a larger ra- 
dius) in the domain of small x. This should not be the 
case for large x where most cells have a density profile in 
p (X R^^ (contrary to what happens for small x), as is re- 
called in App.C. This leads us to consider the immediate 
surroimdings of cells, as we did in section 2.2, which will 
allow us to specify explicitely that the density of the cells 
we count is at least a decreasing function of radius locally 
at scale R. 



5.2. Surroundings of a cell 

In a fashion similar to what we did in section 2.2, and from 
the considerations developped in the previous paragraph, 
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we can define the mass fraction dM. formed by objects of 
scale R- R + dR by: 

r/3 < 7 : dM = (1 + A)P(> A, < A') 

(67) 

[j3>^: dM = {1 + A)P(< A, > A') 

where A = A{R), A' = A{R') and R' = R + dR, with 
dR > 0. These two expressions can be recast into one 
formulation: 

dM = {l + A)P{> A, < A') with dx > (68) 

Then, for /? < 7 we have dR > 0, R' > R, while for /? > 7 
we have dR < 0, R' < R. The mass function is simply 
dAf = po/MdM. As shown in App.C, this definition leads 
to a mass function which is similar to the one encountered 
in the previous section (66): 

dM = xH{x)\dx\ a.nd dN'= ^xH{x)\dx\ (69) 

with a scaling function H{x) which differs from h{x), and 
depends on the slope —/3 of the curve (1 + A){R). As was 
the case for a constant density threshold, we can obtain 
general bounds for H{x), as compared to h{x). They show 
that in all cases both scaling functions h{x) and H{x) 
have the same exponential cutoff at large x ^ .t*, apart 
from a possible normalization constant, and that they also 
have the same power-law behaviour at small x <C x, if 
/? > 7. If /3 > 3, wc get H{x) = h{x) in the limit of 
large 2; ^ a;*. In the case |/3| — * 00, which corresponds 
to objects defined by a constant radius R, we get H{x) = 
h{x). This is natural, since in this case we obtain the usual 
counts-in-cells, at fixed radius R and the behaviour of the 
density around the object is irrelevant (in this approach). 
As was the case in the previous section, if /3 = 7 we cannot 
get a mass function within this framework. In general, 
(1 + A)(i?) is not a power-law, and the function H{x) 
should depend on the scale R through /3{R). However, in 
practice we shall simply replace H{x) by h{x) which is 
known from numerical simulations and should provide a 
fair approximation, according to the previous section and 
the relations between h{x) and H{x) described above. 

Finally, as we mentionned in the previous section, this 
formulation allows us to add explicitcly the constraint that 
the density profile is locally decreasing at the external 
radius of the objects. Thus, if /? < we can define H{x) 
such that dM = xH{x)dx by: 

l^oo <.Min(A',(5) 
xH{x)dx = (1 + A) y d5 j d6'P{6, 6') (70) 

while if /3 > 7 we have: 

/■A 

xH{x)dx = -{l + A) dS d6'P{S,5') (71) 

J A' J A' 



Hence, the mass functions we obtain still present the scal- 
ing in X (if /3 is constant), but no general bounds are avail- 
able, except in the case /? > 3, which is the one which oc- 
curs in practical cases (for galaxies or Lyman-a clouds). 
Then we have an upper bound for H{x) in both limits 
X <^ Xf and x ^ which shows that the decrease of 
x'^H{x) in these limits cannot be slower than the decline 
of x'^h{x). In the case /3 — > 00, which corresponds to ob- 
jects defined by a constant radius, and is of practical in- 
terest as it is the cooling constraint for galaxies at large 
temperature, we obtain: 

a; > : H{x) < h{x) , a; > a;* : H{x) > ^/i(a;) (72) 

Thus, as we could expect, the large x behaviour of both 
mass function is similar (apart from a possible normal- 
ization constant). This is natural since most cells have a 
density profile which decreases with radius in this regime, 
so that the additional constraint that the density is lo- 
cally decreasing has not much influence as it was already 
fulfilled in most cases. This is not so obvious for low x 
cells, where only a specific model for the density field (the 
functions h{x) or (p{y) do not contain all the needed infor- 
mation) can allow one to get a definitive answer. However, 
we showed that the mass function will in any case present 
the same scaling in x, with an upper bound if /? > 3. In 
practice, we shall replace H{x) by h(x), which is justified 
by (72) since we generally have /3 :» 3 (nearly constant 
radius), or /3 = as usual. 

6. Conclusion 

Our purpose in this paper was to examine various ana- 
lytical forms of the mass function, to understand their re- 
lation, and to get some hint to judge their accuracy. The 
main motivation for this study is to get accurate enough a 
formulation so as to be able to use these analytical forms 
to get a robust tool for studying the number evolution 
of the structures, as a preparation for the oncoming data 
from the HST and KECK telescopes. A more theoreti- 
cal motivation was to understand how the universe gets 
non-linear at galactic and extragalactic scales, and the 
relation of the presently observed structures with initial 
conditions. 

To achieve this purpose, we have compared in detail 
two approaches. The first - that we call the linear ap- 
proach - describes, along the ideas of Press and Schechter 
(1974), the clustering at the present epoch by recognizing 
in the primordial spectrum of fiuctuations those overdensi- 
ties that eventually form the structures and counting them 
there. It is in principle limited to objects with an ovcrdcn- 
sity A ~ 200 at the epoch under consideration. The second 
- called the non-linear approach - , that we develop here 
following earlier ideas (Schaeffer 1984, BS89, Bernardcau 
and Schaeffer 1991) on how the non-linear density field be- 
haves (the probability to have an overdensity (1 -|- A) in a 
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cell of scale R is given by P(A) = l/^{Rf h[{l + A)/^{R)] 
where /i is a universal function depending on initial con- 
ditions), describes the present-day clustering directly by 
means of the actual density fluctuations. This description 
holds for any overdensity A provided one is in the non- 
linear regime. 

A first comparison tackles the cloud-in- cloud problem 
raised by Bardeen et al (1986). The arguments invoqued in 
the calculation of these mass functions show that it stands 
as an approximation of a "better" calculation. The latter 
counts overdensities with a given contrast at their edge, 
defined as a future cluster in the primordial density field 
or as an actual cluster in the present density field. Such 
overdensities may actually be composite objects made of 
smaller size overdensities with the same density contrast 
surrounded by underdense regions, so that the same mass 
is counted several times, as a large object and also as sev- 
eral smaller objects. We have defined a measure of this 
overcounting by considering the probability for overden- 
sities to be surrounded by underdensities, and for under- 
densities to be surrounded by overdensities. The problem 
turns out to be severe in the linear stage where fluctu- 
ations switch easily from above to below average, and 
quite mild in the non-linear stage where the overdensities 
are well separated. We also showed that the geometrical 
"cloud-in-cloud" problem does not provide a justification 
for the usual multiplicative parameter 2 generally affected 
to the PS mass function, and is only an ad-hoc correction 
to the dynamics. 

In the linear approach, the knowledge of the r.m.s. 
density fluctuation - which are gaussian - as a function of 
scale and time in the primordial universe is sufficient. In 
the non-linear approach, the behaviour of the actual corre- 
lation as a function of scale and time is needed, but turns 
out to be sufficient to describe the density field although 
the fluctuations are violently non-gaussian (see however 
the discussion sect 4.1 and in Colombi et al, 1996, and 
Bernardeau,1992, for a more precise statement). This il- 
lustrates an important difference of the two approaches. 
In the linear approach, clustering as a function of scale 
and time goes as the linear correlation function (whence 
a factor (1 + z)^), whereas in the non- linear approach, 
clustering goes as the non-linear scaling of the correlation 
function (whence a factor (1 -|- ^r)^"'''). Both scalings can- 
not be simultaneously true, and will provide for a way to 
distinguish between the two approaches. 

This shows also that a more detailed comparison of 
the latter can be done provided one can understand the 
non-linear evolution of the correlation function. We have 
presented a model based on the spherical collapse plus 
virialization that we believe is the explanation of the non- 
linear behaviour of the correlation function, and that is 
seen to reproduce quantitatively the scaling observed by 
Hamilton et al (1988) of the non-linear correlation hmc- 
tion computed in an i7 = 1 universe as a function of the 
extrapolated linear correlation function. This scaling is 



seen not to exist for Q < 1 but our method allows to 
calculate the evolution of the correlation function in the 
latter case: f2 < 1 and A = 0. 

Specific examples of the predicted mass distribution 
and its evolution as a function of redshift are given using 
power-law initial spectra as well as CDM initial c;ondi- 
tions. The linear and non-linear mass functions show an 
intriguing qualitatively similar behaviour, as a function of 
mass as well as a hmction of redshift. Their quantitative 
differences however are serious. The error made by using 
one of the mass functions instead of the other are as large 
as the difference between the cases with different initial 
spectra. One should thus he extremely careful when de- 
riving by means of analytical approximations to the mass 
function conclusions on the adequacy or inadequacy of an 
initial spectrum from an observed distribution of objects 
(galaxies, clusters, Lja clouds). 

The undca'standing of the evolution of the non-linear 
correlation function allows one to phrase the difference 
between the linear and the non-linear approach in a way 
that clearly shows their difference, but also explains their 
apparent similarity in actual calculations. The basic as- 
sumption in the PS approach is that the seed of a cluster 
in the present-day universe may be recognized in the pri- 
mordial universe. These seeds then evolve according to the 
spherical collapse model until they enter the stable clus- 
tering regime, but the evolution conserves their number. 
Using this assumed property, together with the evolution 
of the correlation function according to the spherical col- 
lapse, which agrees with numerical simulations, we derive 
the non-linear density field implied by the PS approxima- 
tion. For a power-law primordial spectrum taken as an 
illustrative case, this non-linear density field implied by 
the PS assumption has a similar scaling as the one seen in 
numerical simulations, but differs quantitatively, the dif- 
ference being of the order of the range spanned by realistic 
changes of the initial conditions. 

Finally, we present the mass functions one can obtain 
when objects are defined by a density threshold which is 
no longer constant but may vary with the mass of the 
objects. This allows one to consider other astrophysical 
objects (like galaxies, see Valageas & Schacffcr, 1997, for 
a detailed study) than the usual just-virialized objects, 
and is of great practical importance. Since this is beyond 
the reach of usual approximations based on an extrapola- 
tion from the initial gaussian field like the PS prescription 
(which cannot model low density regions anyway), it pro- 
vides a clear illustration of the advantages of our approach. 

We relate these various multiplicity functions to the 
properties of the counts- in-cells (characterized by the scal- 
ing function h(x)). Thus, one single measure of h{x) allows 
one to obtain all these mass functions, at any time, within 
the highly non-linear regime. Moreover, we show that they 
should obey the same specific scaling in x (through a scal- 
ing function H{x) close to h{x)) as the counts-in-cells. Nu- 
merical simulations are needed to confirm this scaling and 
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precise the relation between H{x) and h{x). We can note 
that the PS approximation leads to the same scaling and 
provides a simple estimate of h{x), so that we could still 
apply in this case the method described in the non-linear 
approach. However, as we argued above this approximate 
hps{x) is probably too crude to give accurate results. 

Numerical tests have been done to check the PS ap- 
proximation. They involve for instance (Efstathiou et al 
1988) to test the scaling of the mass function as a func- 
tion of the scaling parameter v = But we have 
seen that the typical scaling of the non-linear mass func- 
tion that depends on the variable x can be rcexpressed 
as a function of the former parameter v for power-law 
initial conditions. This may be expected to be approxima- 
tively the case even for more complex initial conditions. 
The scaling test thus may not be sufficient to distinguish 
among the two approaches. The shape of the mass func- 
tion is to be explicitcly considered. The non-linear predic- 
tions have been well checked against observations (Alimi 
et al 1990, Maurogordato et al 1992, Benoist et al 1996) as 
well as against simulations (Bouchet et al 1991, Bouchet 
and Hernquist 1992, Colombi et al 1994,1995,1996,1997). 
Clearly, in the light of the previous discussion, testing the 
scaling laws of the counts- in-cclls may not be sufficient. 
But the galaxy void probability functions typically show 
( Sharp 1980, Bouchet and Lachieze-Ray 1985, Mauro- 
gordato and Lachieze-Ray 1987, Maurogordato et al 1992) 
the non-linear scaling predicted (White 1979, Schaeffer 
1984) by the non-linear theory. Also, numerical simula- 
tions arc accurate enough to give the shape of h{x) that 
is not the one that the PS approximation implies for the 
density field. We believe nevertheless that the numerical 
tests should be redone in the light of the present findings. 
Testing the PS and the non-linear predictions has been 
done by different groups. One now has two well defined 
prescriptions that can be written down using exactly the 
same choice of parameters and directly compared. Also, 
there may also be some dependence of the cluster mass 
function on the specific way in which clusters are defined. 
Clearly, would we define the number of clusters of scale R 
as the number of overdense cells of that scale minus the 
number of overdense ones at scale R + dR, the non-linear 
mass function would be exact. Our explicit consideration 
of the surroundings of a given cell, and the probability 
for the latter to be underdense, shows that one may ex- 
pect a friend-of-friends algorithm to give a similar answer. 
But this has to be examined numerically. Specific compar- 
isons with numerical results (Lacey. Valageas, Schaeffer) 
are quite promising, but will be discussed elsewhere. 

The issue is to have robust predictions of the number 
evolution of matter condensations, so as to be able to con- 
centrate on the evolution of the internal properties of the 
various objects that may be considered. The explicit con- 
sideration of the galaxy luminosity function evolution, as 
well as the evolution of X-ray clusters and Lyman-a lines 
is in preparation. The present, still phenomenological, de- 



scription of the dynamics of non-linearity may also be a 
guide to more rigorous calculations that would be done 
starting from first principles, by the use of the equations 
of motions induced by gravity. 



APPENDIX 

A. The mass function calculated from the early 
gaussian fluctuations 

A.l. Gaussian approximation 

In the case of a gaussian distribution, the probability for 
a cell of radius R to have a density contrast 5 is: 



P{5, R)d6 



27rS(ii) 



(Al) 



The probability distribution P{d, R;S' , R'), or simply 
P{S, S'), to have a contrast S in the radius R and a contrast 
S' in the larger radius R' is 



P{S,S') = 



with 



1 



27rVS2E'2 - S"4 
X exp 



'V2^2 1 



2E2 2S2(S2E'2 - E"4) 



(A2) 



e2 = y d^k W{kRfP{k) , E' = E(i?') 
E"2 = J d^k W{kR)W{kR')P{k) 

where W{x) is the window function (for a top-hat filter in 
real space W{x) = 3(sin.T — xcosx)/x^) and P{k) is the 
power-spectrum of the initial density fluctuations. If the 
window function is a top-hat in R, or a gaussian, we have 
when R' = R + dR: 



S2s'2 - E"* = ( aE 



//4 



,dM 



V 

M J 



(A3) 



where a depends on the slope of P{k) at scale R (it is a 
constant if P{k) is a power-law) and the distribution in 
5', at fixed 6, is a gaussian centered on and width ao 
with: 



So = S 1 



dlnH dM 
dlnM^ 



and ao = '^^^^ i-^^) 



The case of a top-hat in k is very peculiar, since E" = E' 
and we obtain: 



5o = S 1 + 2 



dlnY. dM 



dluM M 
which is fundamentally different 



and (To = E^/2 „ _ ,^ ^ri-(A5) 



dlnM M 
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A. 2. The multiplicity function: Press- Schechter formula- 
tion 

Much has been said about this way to calculate the num- 
ber of objects with mass M (Press and Schechter 1974 , 
Schaeffer and Silk 1985,1988, see White 1993 for a review). 
We recall here a few relevant features of this approach for 
the sake of comparison with our new derivation of the 
mass function. 

The idea, here, in order to avoid entering into the de- 
scription of the universe at the non-linear scales, is to 
try to recognize in the early universe those very small 
density fluctuations that are eventually going to become 
clusters or voids, to count them and to attach to them 
non-linear properties deduced from their properties in the 
linear stage. The mass of these objects is given as an ini- 
tial condition. Their radius or density contrast then is as- 
sumed to be the result of their internal evolution. The 
main difficulty is to be granted that these proto-objects 
remain isolated during and after their formation, so that 
their number or their mass do not evolve in the non-linear 
stage. 

An overdense cell containing the mass M initially fol- 
lowing the general expansion, whose overdensity grows ac- 
cording to the linear theory, slows down and finally col- 
lapses. One often labels the overdensity by the value do it 
would have presently would his evolution be given all the 
way by linear theory 



6L{t) = SoD{t)/D{to) 



(A6) 



The statistics of 6l in the very early universe, where it is 
simply equal to (5, is thus the statistics of at the present 
epoch, using linear theory 



P{d)d6 



1 



27rSc 



(A7) 



The mass of the object remains constant, equal to M , and 
Rm, see (B2), which plays the role of a mass scale, is the 
radius the object would have at the present epoch would 
it simply follow the general expansion. The actual radius 
R has a maximum Rmax at a time t = tmaxi^o)- It then 
starts to shrink. With no kinetic energy at the maximum 
radius, energy conservation and virial equilibrium imply 
that the collapsed object has reached a stationary state 
with a radius Rvir = Rmaxf^ at a time t = tvir{5o), usu- 
ally chosen as the time of collapse given by the spherical 
model, that is when 6 = Sc with dc = 1.69 for f2 = 1 and 
6c = 1.62 today for = 0.1. The overdensity at the time 
of virialization is thus Ac = 178 for = 1 and Ac = 978 
today for = 0.1. All objects having reached this stage 
are supposed to give rise to a massive collapsed halo. The 
mass fraction within such objects of mass larger than M 
is then given by the requirement that 6 in the very early 
universe, where it is negligibly small, is such that, at time 



t, it will be above the required threshold 5c 
M. Therefore 



at the scale 



F,{> 5c) = F^{> 5c) = 



with 



d5 



-<S^/(2E^(M)) 



27rS(M) 



(A8) 



E(M) - So(M) D{t)/D{to) 

the mass and volume fractions relative to the considered 
cells being the same. The operation can be repeated for 
larger cells. Only the mass fraction above threshold at 
scale M, and not at scale M -\- dM, that is 

Fm{> 5c, M) - Fm{> 5c, M + dM) 

= -^F^{>5c,M)dM 

produces objects of mass between M and M -t- dM. It is 
then possible to recognize in the very early universe those 
features that will, at a given much later epoch t, have gone 
above the given threshold. This leads to the mass function: 



r]{M) 



dM 
M 



1 Pa 5c 
2^ MY. 



dlnS 



dlnAf 



-5^/(2E2 



dM 



(A9) 



a formula in principle valid for any mass M. 

The advantage of this approach is its extreme simplic- 
ity, if one does not consider the cloud-in-cloud problem, 
together with a direct connection with the primordial fluc- 
tuation spectrum. A rather serious drawback is that the 
statistics is done in the very early universe, trying to recog- 
nize protoclusters among small overdensities while it is not 
clear that there is no evolution of the number of objects 
after non-linearity has occurred. An obvious consequence 
of (A9) is that the total mass in the above halos 



Jo 



dv 



2tt 



2 



(AlO) 



represents only half of the matter. From our experience 
in modelling the present, obviously non-linear universe, 
this is not true: most of the mass must be within the con- 
densed objects. For this reason alone, the PS approach 
cannot be an exact theory as has been claimed at various 
places. This problem can be cured using the excursion sets 
formalism (Cole 1989, Bond et al. 1991) for the very spe- 
cific case of a top-hat window function in k (which should 
not be confused with the usual top-hat in i?, that we use 
thougout this paper), which gives the PS mass function 
multiplied by the factor 2. However, as we argue in chap- 
ter 2 this cannot be extended to other window functions 
(the fact that a top-hat in /c is a very peculiar case was 
also apparant in the previous paragraph). Also, despite 
the increased complexity, one is still working within the 
linear universe (with no control of any evolution once in 
the non- linear stage). For similar reasons (cloud-in-cloud 
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problem, and, although to a lesser extent, dependence on 
the window function, complexity even in the linear approx- 
imation), the attempt by Bardeen et al. (1986) to define 
the mass function by counting the peaks of the density 
distribution in the linear stage is not entirely satisfactory, 
as already stated by the authors. 

A. 3. Surroundings of a cell 

An alternative formulation starts from the remark that, in 
the prescription described above, one substracts the frac- 
tion of matter potentially non-linear at scale M + dM from 
the fraction at scale M . This provides no guarantee that 
the immediate surroundings of a cell above threshold are 
below that same threshold, as one could expect for a true 
proto-cluster. It is, on the other hand, possible to calcu- 
late the probability for this to occur. Divide the universe 
into cells of size Rm and consider, for each of these cells 
a cell centered at the same place but of size Rm + dRM ■ 
The probability P{> 6,-, Rm - < 5c, Rm + dRM), which we 
shall note simply P(> dc,< Sc), for having an overdensity 
6 above a given threshold 5c in the original cell and below 
6c in the larger cell is (using (A2)), still in the "linear" 
gaussian regime. 



P{> 5c, < 5c) 



)_5c 



dlnE 



-5,V(2S') 



dlnM 



1-^ 



dM 



du 



2w 



(All) 



with = 5c/Tj{M) and f3{M) is a parameter which de- 
pends on the form of the initial power-spectrum P{k). 
If P{k) is a power-law /3 is a constant. Note that for 
a top-hat in k, one cannot define a mass function from 
P{> 5c, R; < 5c, R + dR) as this latter quantity behaves 
as y/dR/R for small dR/R. The formula (All) looks very 
similar to (A9) , and indeed this prescription is closely re- 
lated to that previous formulation: 



P{>5c,<5c)= Fmi>5c,M)- 
+P{< 5c, > 5c 



F„(> 5c,M + dM) 



(A12) 



It is indeed staightforward to see that when substracting 
Fmi> 5c, M + dM) from Fm{> 5c, M), one substracts a 
little too much, the case where the density is low inside but 
high in the larger cell being included in Fm{> 5c, M+dM). 

If the evaluation of the mass function by substraction 
of the mass fractions F^ at different scales has any mean- 
ing, it should not be too different from P(> 5c, < 5c), the 
difference between the two measuring the error due to the 
fact that some of the protoclusters, that is density patches 
in the linear regime with a given density threshold, may 
contain smaller patches with the same density thcshold 
that are then also counted as protoclusters at a smaller 
mass. Similarly to Chapter 2.2., the use of P{> 5c, < 5c) 







loe[it(t')l 


0.4 
























-1 






1.6 




log(i^) \ 




-0.8 -0.4 


0.4 



Fig. 10. Behaviour of the mass fraction at the present epoch 
fi{M)dM /M = ^{v)dv/v , obtained by counting the overdensi- 
tics in the early universe where the fluctuations arc sufficiently 
small to be described by linear theory, with the assumption 
that their number does not evolve once they become non-linear. 
The graph shows the mass fraction per unit logarithmic inter- 
val of V : jj,x{v) and fj,ps{i>), in the case n = —1.5 , P = 0.43 



to define a mass function leads to the mass fraction in 
objects of mass M : 



M><(M) 



dM 



1 



V e 



dv 



27r 



\ 

J — OO 



1-^ 



du 



27r 



while the PS prescription gives 



HPs{M) 



dM 



2-K 



V e 



-1/2/2 



dv 



(A13) 



(A14) 



Here, iJL{M)dM / M is the mass fraction in objects of mass 
between M and M + dM, and r?(M) = po/M n{M). 
These two mass functions are similar for large masses 
(that is ^ 1), as can be checked on Fig. 10, but at small 
masses /Lt><(z/)/z/ diverges as 1/;^ while /ips(j^)/zv remains 
finite. As a consequence, ^j,y<(M)dM / M = +oo while 
HPs{M)dM/M = 1/2 . This is due to the fact that 
when S ^ oo, (M 0) the density around a given point 
shows wide oscillations (^ E) and crosses the threshold 5c 
many times, which leads to a large over-counting. This is 
the well-known cloud-in-cloud problem raised by Bardeen 
et al.(1986), that renders this approach uncertain. 

B. Counts in cells and matter distribution 

A convenient way to describe the density field above the 
galactic scales is to divide the universe into cells of a given 
size R and to consider the distribution of matter within 
these cells. It is then of interest to define the probability 
distribution of the density p within each cell or equiva- 
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lently of the deviation 6 oi p from the average density po 
of matter in the Universe 

S= {p- Po)/Po 

and to study it as a function of the cell size i?, that plays 
the role of a smoothing scale. In the following, wc shall, 
as a rule, omit to precise cxplicitcly the radius of the cells 
wc consider, for instance we shall note P{6, R) simply as 
P{S), except when we consider different lengths and there 
is a risk of confusion. 

B.l. Gaussian limit 

In the regime where 6 and ^ are small, say S , ^ 1, 
that is at very large scales, this probability takes the well- 
known gaussian form 



P{6)d5 



1 



(Bl) 



and is determined once £,{R) is known. The latter is the 
average of the two-body correlation function ^2{ri,r2) 
within the volume of the cell 



d^ri d^r2 
y2 



6(n,r-2) with V = -ttR^ 



For a gaussian distribution it is usually denoted in the 
litterature. We use the notation ^ (R) in use for the actual 
mass fluctuation at scale R in the non-linear regime. Wc 
keep the notation for the r.m.s. mass fluctuation 

obtained in the linear approximation, calculated at scale 
M in the very early universe (where it is assumed to be 
infinitely small) and extrapolated to any epoch with the 
proper linear growth factor. We will also use for the lin- 
ear correlation function the notation £,l{Rm) = S^(M) , 
where Rm is a comoving length that plays the role of a 
mass scale, defined by 



M = 47r/3 po Rli 



B.2. Deviations from gaussian behaviour 



(B2) 



Deviations from the gaussian behaviour, however, occur 
as soon as 5 is not very small. They can be described 
as follows (White 1979, Schaeffer 1984, 1985, Hamilton 
1988b, BS89, Bernardeau and Schaeffer 1992, Bernardeau 
1992). One introduces the auxiliary function 



=c (_\n-l 



71=1 



n! 



(B3) 



where the coefficients Sn , which are all positive, describe, 
for n > 2, the deviations from the gaussian behaviour. 
These coefficients are related to the many-body correla- 
tion functions ^„(ri, rn) by 



5„ = ^ with ?„= / 



^„(ri,...,r„) 



They depend in principle on the scale R and on the shape 
of the considered cells. The function ip{y) was introduced 
by the above authors in the case where the many-body 
correlation functions exhibit a scaling behaviour, precisely 
such as to make 5„ scale independent and argue that, to 
a rather good approximation, these parameters are also 
shape independent, the latter being not compulsory for 
the purpose we aim to in this paper. The probability dis- 
tribution of the density within the cell is then given by 



P{5) 



-\-ioo 



dy 



—= exp 



o 27ri^ 
which can be inverted as 



{l + 6)y-^{y) 



P{6)d6 



(B4) 



(B5) 



In the limit where (p{y) = y — y'^/2, that is when all co- 
efficients Sn for n > 2 vanish, the gaussian behaviour is 
recovered. Schaefi^er (1985) by comparison with data, and 
BS89 for very general reasons due to the behaviour of ip{y) 
however argue that Sn should grow at least as n! and show 
that, as soon as 6 is not much smaller than ^, the values 
of y contributing to the integral are of order unity and 
the deviations from a gaussian behaviour are important. 
For S much larger than ^, large tails develop, the gaussian 
behaviour being replaced by a slow exponential decrease. 

In the limit ^ <C 1 , but for any S, i.e. in what he called 
the quasi-linear regime, expected to hold at large scales 
or early times, Bernardeau (1992, 1994) has been able to 
calculate explicitely the function ip{y), that arises purely 
from the gravitational instability, and its relation with the 
primordial spectrum of fluctuations. 

In the limit ^ » 1, that is relevant at small scales and 
late times, under the sole assumption of scale-invariance 
of the many-body correlation functions expected (Peebles 
1980) in the deeply non-linear regime, BS89 give the prop- 
erties of the function (p{y) independently of any model and 
whence of P{S). They discuss possible analytical models 
for (p{y). Further models closer to what is required by the 
data are given in Bernardeau and Schaeffer (1992). The 
scalc-invariance of the correlations implies that the coef- 
ficients Sn depend only on n, being independent of the 
cell size and time. Although the qualitative properties of 
these coefficients, thus, are similar in the non-linear and in 
the quasi-linear regimes, their values are expected, from 
theoretical reasons, to be different. Numerical simulations 
(Colombi et al 1996) indeed show they are constant in 
both regimes ^ <C 1 and ^ 1, with a strong transition 
at scales/times for which ^ increases from ~ 1 to ~ 100, 
in agreement with the spherical collapse model originally 
derived only for the two-body correlation function (Gott 
and Rees 1975, Peebles 1980). Astonishingly, the func- 
tions ip{y) mentioned above, that reproduce the data in 
the deeply non-linear regime and the ones calculated ex- 
actly in the quasi-linear regime do not look quantitatively 
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very different. Indeed, their properties originate from gen- 
eral model-independent considerations such as the special 
scaling due to the fact the coefficients S'„ depend only on 
n, their rapid growth with n, the positivity of the proba- 
bilities and of the reduced moments < 5"^^ >. 

B.3. Properties of the density distribution as a function of 

smoothing scale 

We summarize here the properties of the probability dis- 
tribution of the density within a cell, as derived by BS89 
in both the ^ 3> 1 and the ^ ^ 1 regimes. These au- 
thors showed these properties describe the large-scale mat- 
ter distribution in the Universe in the deeply non-linear 
regime, whereas Bernardcau (1992, 1994) showed they arc 
also relevant - with in principle different parameters - to 
the quasi-linear regime. 

B.3.1. Model-independant properties of (fi{y) 
The function ip{y) behaves as 



f y^O : ip{y)=y-y^/2 + ... 

[y — > oo : ip{y) a y^~'^ with < a; < 1 



(B6) 



The rapid growth of 5„ as a function of n implies a sin- 
gular behaviour of ip{y) at small negative values of y, say 
ys = — l/x* with being large (.t* ^ 10): 



y^yt- <P(y) = -asT{uJs) [y - ysY 



(B7) 



where we neglected less singular terms. Moreover, ip{y) 
and all its derivatives (— )" ip^^\y) must be non-negative 
for all values of y > 0. 

B.3.2. Large scales / early times 

For ^ <C 1 the probability P{5) can be evaluated using the 
saddle-point method. Defining 

V'(^) = ^(y) - (1 + 5)y 

with y given as a function y{6) of 5 by 

^'{y) = l + 5 

we get 



P{5) 



liP"{6)_ ^_ 



27rC 



(B8) 



For \6\ <C 1 only, (p{y) can be expanded around y = to 
yield y(5) = —6 and tp{6) = , which is the gaussian 
behaviour, more precisely valid when \S\ ^ l/x* so as to 
have the series giving 'p{y) converge, indeed an extremely 
small value (of the order of 1/a;* ~ 0.1). 



B.3.3. Small scales / late times 

For ^ :» 1, the many-body correlations totally dominate 
the density distribution. The probability P{d) takes two 
different forms. 

a) For values of {I + S) that are not too small, f{y)/$, 
in the exponent of (B4) can be expanded, to give as the 
first non- vanishing contribution 



o 27ri^ 



(B9) 



In this regime, P{S) exhibits a scaling law, in the sense 
that ^ P{d) is a function of the unique parameter x = 

a + s)/i : 



with 

h{x) 



J -ICO J-ioo 



dy 'p'jy) 

2iTi X 



(BIO) 



e^^(Bll) 



The behaviour of ip{y) near y = implies the sum rule 

POO 

/ xh{x) dx = 1 (B12) 
Jo 

which states that all the matter is within cells of density 
contrast satisfying (BIO). In fact we have: 



n > 1 



: Sn= I 
Jo 



x'^hix) dx , Si = S2 = l (B13) 



At small values of [1+5) that still obey the condition 
(BIO), P{5) is a power-law given by the behaviour of </?(y) 
at large y 



a(l — uj) 



1 + 8 



(B14) 



and for large (1 + (5)/^, the above saddle-point evaluation 
holds, to show that the singular behaviour of (p(y) at y = 
— 1/a;* imposes 





■ 1 + ^" 











l + 5>^ : P(<5)^ =1 



In the limit ^ ^ oo we get: 

/oo _ 
^ (1 - e-(i+*)^/«) P{S)d6 



(B15) 



and 



/OO _ 
e-(i+5WC {i + 5)P{6)d5 (B16) 
-1 
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b) For values of (l+S) that are not too large, only large 
values of y contribute to the integral defining P{S) so that 
(p{y) can be replaced by its asymptotic form a y^~'^ to 
yield 



P{6) 



dy 



= exp 



{l + d)y- ay 



l-Ul 



(B17) 



that implies another scaling of P{6) as a function of the 
variable ^ = (1 + (5)a-V(i-)f "/^'""^ 



1 + S«l : P{S)=a-'/('-^r^^'"^^ g{z) 



with 



/+200 
-ioo 



dt 



- exp{zt — t 



1-0J\ 



2m 



At small values of z, P{5) takes the form 



(B18) 
(B19) 



27rw2;(i+'^)/'' 



exp 



1 - w 



if i+^«r"^^'""^ 



(B20) 



which vanishes exponentially when (1 + (5) and whence z 
goes to zero. For large values of z, still under the condition 
(B18), P{5) is the power-law 



P[5)^a-''^'--^r'^'-''^^^z--^ 

1 T — w/(l— oi) ^ c- 

when ^ + 



(B21) 



This expression is identical to (B14) since in the region 

^ ^ ^ ^l + (5^^ the conditions of validity of both 
expressions (B14) and (B21) overlap. 

The /(,(.'/;) and g(z) scalings have been seen to hold in 
the deeply non-linear regime in galaxy surveys the same 
scaling is expected for the galaxy distribution as advo- 
cated by BS89 but with a different function hgai{x) (since 
the distribution of galaxies is biased with respect to the 
underlying matter distribution) as shown by Bernardeau 
and Schaeffer 1992 - for the CfA (Alimi et al 1990) as 
well as the SSRS (Maurogordato et al 1992, Benoist et al 
1996). For galaxies one has to measured to be ~ 0.6 to 1, 
a ~ 1, and a poorly determined value of 1/x,, that is def- 
initely smaller than 0.3 . Much more precise tests can be 
done using numerical simulations of the matter distribu- 
tion (Bouchet et al. 1991, Colombi et al. 1992,1994,1995) 
for various initial conditions. The density probability dis- 
tribution is found to obey extremely well the above scal- 
ing, with functions h{x) that depend on the initial con- 
ditions, with values of uj around 0.5 and a well defined 
exponential decrease with l/x* measured to be between 
0.07 and 0.2 . In the quasi-linear regime, too, the theoret- 
ical predictions are practically indistinguishable from the 
results of the simulations at the times and scales corre- 
sponding to this regime (Bernardeau 1994). 



3.4. Picture of the density fluctuations implied by hierar- 
chical clustering 

In the non-linear regime, the density distribution is thus 

a power-law of the density p = {l + S)pQ with an exponen- 
tial falloff at p > pc — poS, a,t the high densitiy end, and 

-I //-I _ \ — —uj/{l—(jj) 

another cut ai p < p^ — po a ' , as we can 

see on Fig.ll. For a smaller scale R, ^ gets extremely large, 
being infinite as i? ^ 0. So the range where P(S) behaves 
as a power-law has a wider and wider extention. There is 
a non- vanishing probability to have arbitrary large densi- 
ties, which means that sampling with small cells provides 
for rare peaks whose hight increases without limit when 
R goes to zero. The probability to find cells of arbitrary 
low density gets arbitrarily large in the same limit: there 
are numerous very underdense "holes" in the density dis- 
tribution. 
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Fig. 11. Probability P{p) for finding a density p in a cell within 
which the average correlation function is ^ = 200 in the case 
ui = 1/2, a»s = —3/2 . The density distribution is a power-law 
with a cutoff at small (pv) and large (pc) densities. Both p„ and 
Pc are scale-dependent: for large ^, pv goes to while pc goes 
to infinity. The curve for p ^ pv corresponds to the part of 
P{p) which scales as h{x), with a power-law decrease followed 
by an exponential cut and the curve for p <^ pc corresponds to 
the part of P{p) which scales as g{z). Note that both regions 
overlap. The dashed line on the left is the approximation (B14), 
P{p) oc p"~^, and the dashed line on the right is the power-law 
P{p) oc p^'~^ corresponding to (B15) without the exponential 
falloff. 



The fraction of volume occupied by cells of density 
larger than p = [1 -\- A)/9o is 



f-OO 

F,{> A) = / P{6) 

J A 



d6'. 



(B22) 



for 1 -I- A » a^/(^"'^)$ '^^'^ The latter limit vanishes 
for small R: even for moderately small positive values of 
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the density contrast A, Fy{> A) is given by the above 
expression, and is small for large ^: the volume occupied 
by the overdense cells is vanishingly small 

The fraction of matter within cells of overdensity larger 
than A is, for 1 + A < f , 



Fmi> A) 



~ 1 - 



(1 + S)P{S) d6 
a(l-w) /1 + A 



r(a; + 1) 



(B23) 



In the non-linear regime where ^ is large, even for moder- 
ately large A, all the mass is contained in overdense cells. 

At the larger scales, or the earlier times when ^ gets 
smaller, the power-law region in-between the two limits 
py and pc where the probability falls down to zero shrinks 
and eventually disappears for ^ <^ 1, but whatever small 
^, the gaussian behaviour appears only for densities very 
close to the average 
volume in overdense {6 > A) cells is 



< 0.1). For |A| -C C^^^ -C 1, the 



F,(> A) = P{d)d5c^l 



2 ./2< 

while the fraction of mass is, since S is small 

/>00 f-OO 

Fm{> A) = / (1 + 5)P{8) d6 ~ / P{S) dS 

J A J A 

_ 1 _ A 

~ 2 ^ 



(B24) 



(B25) 



Thus, these overdense cells occupy half of the volume and 
contain half of the mass. This is genuinely due to the 
smallness of the density fluctuations in this regime. 

C. Non-linear models: surroundings of a cell 
C.l. General results 

As we noticed in the main text, an alternative defini- 
tion for the mass function that takes the surroundings of 
an overdensity explicitely into account is r]{M)dM/M = 
Po/Mfm{> A,< A;R)dR where /,„(> A, < A; R)dR = 
Fm{> A,R;< A, R+dR) is the mass fraction in cells with 
an overdensity larger than A in i? but smaller than A in 
R + dR. The mass fraction Fm{> A, R; < A, R') in cells 
with overdensity larger than A in i? but smaller than A 
in R' verifies Fm{> A, R; < A, R') = (1 + A)P(> A, R; < 
A, R') so that 



r;(M) 



dM 
IT 



Po/M (1 + A)P(> A, R; < A, R') 



(CI) 



In the same way we introduce a second definition for the 
multiplicity function of underdense regions r}u{R) dR/R 
that can be formulated as: 



dR 



P{< -A,R-> -A,R') 



(C2) 



There is an important relation among these probabilities 
that may be written 



P(> A, R; < A, R') = P{> A, R) - P{> A, R') 
+P{< A,R->A,R') 



(C3) 



or 



P(< - A, R- > -A, R') = P{< -A, R) - P{< - A, R') 

+P{> -A,R;<-A,R') 

both being valid for either sign of A . The difference in the 
r.h.s is our standard definition of the mass function. The 
alternative definition of the mass function, the standard 
one and the distribution of underdense regions are thus 
related 

C.1.1. Distribution within two volumes 

We are thus led to consider two volumes: a central 
sphere of radius R, volume V, and a sphere of radius R', 
volume^', or the corona of volume V" = V — V. We 

need the probability distribution P{S, R: 6' , R') for a den- 
sity contrast 5 over R and 6' over R' . As one does for a 
simple cell, we define the generating function: 



V{\,\")= X^X"^"P{N,N") 



(C4) 



JV,JV" 



where P{N, N") is the probability that there are N par- 
ticles in the sphere V and N" particles in the corona V" . 
As is well-known, the generating function is related to the 
many-body correlation function through: 

P(A,A")=exp[x(A,A")] (C5) 
with 

x(A,A") = f:^^ f l[d'riUf'{ri)Uru...,rn) 

n=l V ^ „ 

and 



n n 



, , /(I - A) over V 
^ A") oyctV" 

Then, as is usual for simple cells, we define (BS89) 

- f -f-r f TT d^r'' ^ , n n ^ 

^k,n-k= [[-rr [[-r^ ^n{n,..,rk,ri,..,r^_,^) 



n—k 



k,n — k 



^ , N, = poVi , e = V"/V 



2/ = (1 - \)N, , y" = e(l - A")A^c , x(A, A") = -i>{y,y")/i 
Hence: 

I \ ^ / \ o ..fe n—k 



n=l ■ fe=0 ^ ^ 



(C6) 
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case ), gives: 



and the continuous limit in N,N" ( denoted 1^,1/" in this with Re{y") > along the integration path, which re- 
places the function h{x) we had for simple cells: Hence to 
get the behaviour of the new mass function as compared 
to the one we got from simple cells we only need to com- 
pare h{x) and H{x). It is also convenient to write H{x) 
as: 



/-\-iOQ 
-ZOO ^ 



dydy" 



exp 



The function ^p{y,y") depends on e through the coeffi- 
cients Sk,n-k- It is the analog of the function (p{y) we 
used for simple cells. Using the fact that Xv,V"(A, A) = 
Xv+v"W, we have: 



^{y,y) = {l + er^^ ^ y{l + e) 



Nl-7/3 



(C7) 



H{x) 
with 

Q{y)-- 



-\-ioo 



dy Q{y) ^ 
2m X 



C.1.2. Limit of an infinitesimally thin corona 

Since Sn-1,1 = [1 — 7/3 + 7/(3n)]S'„ to zeroth order in e, 
the expansion of y") to the first order in e or y" is: 



+ioo J.,// 



dy" i,{y-y",y") 



2m 



,111 



(CIS) 



, Re(y") > (C14) 



xl){y,y")^^{y)+y" 



... (C8) 



that can be usefully compared to the same expression 
(Bll) for h{x) in which ^p'{y) replaces Q{y). 

C.1.3. Properties of H{x) 



It is also possible to relate the normalization of the mass 
If we now consider the density contrasts 5 over V and 5' function to the function Q{y). Indeed, if we note that 

r]{M)dM/M = po/M n{M)dM/M where i^{M)dM/M 
is the mass fraction in objects of mass between M and 
M + dM, we have: 



over V we get: 

'■+*°° dydy" 



e(27riC)' 



+ 



X exp 



1 



^ {l + e){l + SJ-{l + 6) ^„ _ i^{y, y") 



Then a simple integration gives to first order in e : 



P(> A, < A) = e 



-^rioG 



dydy" 1 Ki+AK^+V'K^T/g 



(27ri)2 y"2 

with Re {y") > along the integration path, and 

- (_)n-l " 



(C9) 



i>{y,y") = Y. 



E 



^fc,n-fe yS""-' (CIO) 



n=l fe=0 

where the function ip{y, y") is independent of e and: 

i>{y,y") = iimV'(y,2/") 



Sk,n-k = lim Sk,n-k 



Since r]{M)dM/M = po/M F„(> A, R; < A, R') , we 
get writing dM = cM: 



r°° dM r°° 

J, ^^^^W^J, ^H{x)dx = Q{0) (C15) 

Hence (5(0) should be equal to unity if all the mass is 
distributed within the objects defined this way, and if the 
same overdensity has not been counted several times at 
different scales (as can be feared due to the cloud-in-cloud 
problem) 

It is readily seen from (C3) that H{x) is related to 

h{x) by 

xH{x) dx = — ^x J h{x)doc^ dx 

+ (1 + A)P(< A,i?;> A,i?') 
We have from the expression of P(z^, v"), in the limit 



: 



niM) 



dydy" 1 + ^ ui+A)(vW)-4:(y,v")}/£ 



/ + 200 
-ioo 



dydy" 



{{l+S)y+(.l+S")y"-i>{y,y")}/i 



Using the same approximations as for simple cells we have 
in the limit of large ^: 



(27ri02 

The corresponding Laplace transform is: 



(C16) 



and 
H{x) 



dM 



^xHix)dx with.= i^ 



1 



(Cll) 



/OO _ 
g-m+S)y+a+5")y"}/S S")dddS" {C17) 



while for simple cells, we have(B5). We can note that these 
relations imply: 



dydy" i'iy,y") x{y+v") 

{2mf xy"^ 



fRe(y) > : 



< 1 , Re[^{y)] > 



^^^^^ iRe(y) > , Re{y") > : Re[^(y, y")] > 
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For double cells we get: 

/oo _ 

If we report this expression into (C14) we obtain: 



Q{y) = lim - 



X dd" [(1 + ^) - (1 + 6")]P{5, S") (C18) 
Moreover, we get from the definition of h{x) and H{x): 



( xh{x) = Jim ^ (1 + 5)P{5) 



xH{x) 



Jim i I 



[{l + 5)-{l + 5")]P{5,5")d5" 



with a; = (1 + 5)/S^. Hence wc obtain: 



a; > : - 

7 



< 1 + .5" > 



/<5 



1 + .5 



Hix) 3 
h(x) 7 



(C19) 



where < 1+S" > /g is the mean value of {1+5") when there 
is a fixed density contrast 6 in the internal sphere V. When 
a; » x*, one can show that < 1 + 6" >/s= (1 — 7/3)(l + i5), 
which corresponds to a fractal dimension 8 — 7, so that: 



a; a;* : h{x) < H{x) < —h{x) 



(C20) 



Finally, the normalization of the mass function is given 
by: 

g(0) = ^J dS j d5"{{l + <5) - (1 + 5")]P{5, <5")(C21) 
Thus we obtain: 

Vx : < H{x) < -h{x) and < Q(0) < - (C22) 
7 7 

The mass function defined by means of P(> A, R; < 
A,R') due to possible density inversions measured by 
P{< A, R; > A, R') counts the same overdensities sev- 
eral times. This illustrates the overcounting inherent in 
a density threshold algorithm, due to the cloud-in-cloud 
problem. The bound (C22) however shows this problem is 
mild when the counts are done in the non-linear regime. 
The same counts of overdensities in the early universe lead 
to a dramatic overcounting since each overdensities are 
counted infinitely many times at different scales (App.A) 
in the limit ^ ^ 00. We can note on Fig. 11 and eq.(B14) 
that integrals of the form /(I + S)P{5)dS are dominated 
by values of .t ^ or (1 + 5) ^ .x,^. Hence, we can expect 
that (5(0) ~ 1 in the general case. Thus, both mass func- 
tions, expressed with h{x) and H{x), are probably quite 



similar. We shall confirm this in the next section in the 
case of a specific model. 

We also obtain the general relations: 



P(>A,<A) _ 



P(<A,>A) 



= J [{l + A)-{l + S")]P{A,S")dd" 

/OO 
[{l + S")-il + A)]P{A,6")dd" 
A 



(C23) 



in the limit e 0. We shall see in the next section 
in some particular cases that for large densities P(> 
A, < A) 3> P{< A, > A) while for very small densities 
P(> A,< A) < P(< A,> A). 



C.1.4. Underdense regions 

In a fashion similar to the calculation of P(> A, < A) one 
gets: 



P(< -A,> -A) 



+^°° dydy" € 
(27ri)2y"2' 



.{(l-A){y+y")-4,}/i 



with Re {y") < along the integration path. We 
shall see in the next section, with a particular model for 
tl>{y,y"), that this leads to 



VuiR) ^ = - ^ G{z) dz 



(C24) 



where G{z) replaces the function g{z) we had for simple 
cells, and is defined by: 

P(< -A, > -A) = -G{z) dz (C25) 

and z = (1 — A) a-i/(i-'^) ^'^^'^ ig the variable we 
already encountered in the case of simple cells. Using (14) 

we can see that 



-G{z)dz = -g{z)dz + P(> -A, < -A) 
which implies that 
> : G{z) > g{z) 



(C26) 



(C27) 



We can note that this also means that P(> —A, < —A) < 
P(< — A, > —A). This is consistent with the fact that, 
when a; <C a;*, hence when the scaling in g{z) is valid, we 
have < 1 + 5" >/s= (1 + 7/3 lu/{1 - w))(l + 5), which 
corresponds to a fractal dimension 3 + 7a;/(l — w) larger 
than 3. Prom (C26) and (C23) we also see that 



G{z) = g{z) - ^ J_^^[{1 - A) - (1 + 5")]P{-A, 5' 



)d5" 



which, since e 



3(l-t^) dz 



, implies 



G{z) < g{z) + _ A)P(- A, 5")d5" 

1^ z J-l 
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that is 



G{z)< 



1 + 



3(1 -w) 



9{z) 



Whence, collecting all inequalities we obtain 



9{z) < G{z) < 



1 + 



3(1 -w) 



7C1J 



(C28) 



that implies very strong constraints on the asymptotic 
forms of G{z) which are the same as those of g{z), ex- 
cept for a normalization constant, for both z ^ and for 
z ^ 00. In the latter case, writing G{z) = k g{z) oc z'^~'^, 
we then have fc > 1. When 2 » 1 and x <C 1 we are in 
the region where both scalings in z and in x hold: H{x) 
thus behaves as a;'^"^ as does h{x), except in the special 
case where fc = 1 and where H{x) then decreases faster, as 
^1^-1 ^Yic generic case. This completes the proof that in 
general H{x) and h{x) have the same asymptotic forms. 

In a more specific way, one can show that if the singu- 
larity of '!^(2/, y") near (ys,0) can be written as: 



^(2/, v") = -a,r(a;,) [y-y, + {l- 7/3)2/"]" 



(C29) 



(where the factor (1 — 7/3) is imphed by (C8)) one gets 
for a; » X*, 1^ » 1 : 



H{x) ~ h{x) 

P{> A,< A)>P(< A,> A) 



(C30) 



This will also be the case for the Hamilton model consid- 
ered below. Similarly, if ({t) oc t~'^ for large r with k > 0, 
one obtains for z <C 1, ^ ^ 1 : 



G{z) ^ g(z) 

P{< A,> A)>P(> A,< A) 



C.2. Tree-model 



(C31) 



In the peculiar case of the tree-model (Schaeffer 1984, 
Bernardeau and Schaeffer 1992) , the many-body corre- 
lation functions veriiy: 



^„(ri,...,r„) 



(a) ta n-1 



(C32) 



where (a) is a particular tree topology connecting the 
(n — 1) points without making any loop, Qn"^ is a pa- 
rameter associated with the order of the correlation and 
the topology involved, ta is a particular labelling of the 
given topology (a) and the last product is made over the 
(n — 1) links between the n points with two-body correla- 
tion functions. Moreover, we assume that the parameters 
q1"^ are given by 



Qi"' = n 



, d.(a) 



Vi being a weight associated with a vertex of i lines and 
di{a) the number of such vertices in the topology (a), and 
we define the generating function 



C{t) = E with vo = = 1 



(C34) 



i=0 



In this case, one can show that x('^i'^')' that is 
—ipiUjy")/^, is given by the system 



r{r) = ^l d^r'pot^{r'nry)C{T{r')) 

V+V" 



X = - [ d' 
(. Jv+v" 



rponir) 



C(T(r)) - 



r(r)C'(T(r)) 



(C35) 



The equations for % = —'il'{y,y")/£, with /z(r) = 1 — A in 
volume V and 1 — A" in V" are then obtained from (C35) 
by taking the limit e — > at fixed, finite y": 

r{r) = -,/^^i(^C'(r(r'))-."^C'(r(i?)) 



^ = y I ^ 



C(T(r)) 



r(r)('(r(/-)) 



(C36) 



arm 



Tm\r(Ry) 



which are exact equations. 
C.2.1. Constant r approximation 

A very useful approximation, for a single cell {V" = 0) in 
the case where fi{r) is constant in the volume V, which 
turns out to be astonishingly accurate in the tree-model, 
is to use for the solution r(r) of (C35) a constant r. This 
leads to the equations 



T = -yC{r) 

My) = y[C(T) - tC'(t)/2] 



and 0'{y) = C(t) (C37) 



C.2.2. Hamilton model 

We shall now consider the more specific case of the Hamil- 
ton model: 



C(r) = l-T+- (C38) 
For a single cell, this leads to the exact solution 



f{y) = y 

Jv 



d^r d^r' / y ^ 
V 2| 



(C39) 



where the inverse is meant in the operator sense, within 
the cell V. In the constant r approximation we get 



i=2 



(C33) (^(y) 



y 

1 + 2//2 



(C40) 
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/+ioo 
-ioo 



dy y 
2-Ki l + y/2 



C.S. Non-constant density contrast (1 + A)(i?) 

As explained in the main text, Chapter 5, one is led to 
consider objects defined by a density threshold which is 

not necessarily constant, and given by a curve (1 -\-A){R). 



For a cell with an infinitesimally thin edge, wc have note -/3 the slope of the curve (1 + A){R), we have: 



the exact solution of (C36), after the elimination of r(i?) 
and ^{r,R) using (C8): 



dln{l + A) dlnx 



dlnR 



dlnR 



7-/3 



(C47) 



i>{y,y") = y:>{y)+ 



with 



y"[{l-a)^ + a^'{y)] 



Then, we define the mass fraction dM formed by objects 



^Tr[|(l 



(C41) of scale R- R + dRhy. 

M. 2.\ J- S. I 

13 Kj: dM = {1 + A)P(> A, < A') 



(a= (1-7/3) = 0.47 

\/3 = (1 - 7/3)(l - 7/4)(l - 7/6)/(l - 7/2) = 1-54 

where we have used 7 = 1.8. 

In the constant r approximation (C41) reduces to 



/? > 7 : dTW = (1 + A)P(< A, > A') 



(C48) 



where A = A{R), A' = A{R') and R' = R + dR, with 
dR > 0. 

We first consider the case /3 < 7. Then, in a fashion 
similar to what we did in App.C.1.2, we obtain: 



If ,M ^ y + y" + yy"{p-a)/2 
Vyy^ y ) l + y/2 + y"p/2 + yy"{(3 - a)/4 



We have 

My") > , 



y + {y-y")y"{l3-a)/2 



2m 



(C42) Pi> A, < AO = / ^-i,e«^+-)[^+(-'^/3)^"I-^>/? 

with Re(y") > along the integration path. As was the 
case for a constant density threshold, we can write in the 
regime where the scaling in x is valid: (1 + A)P(> A, < 
A') = xH{x)dx with: 



xH{x) 



1 + (y - 2/")/2 + 2/"/3/2 +{y- y")y"{(3 - a) /A 
A simple calculation shows that: 

2 



7-/3 



/+«oo 
-ioo 



dydy" ■0 
{2mY y"2 



„x[3/+(l-/3/3)y"] 



(C49) 



y- 

y 

and 
X - 



Q{y) 

00 : Q{y) 



3 1 

~ 7(1+17 



with Re(2/") > 0. Hence, the mass fraction can be writ- 
ten dM. = xH{x)dx as usual. In terms of the probability 
distribution P(A, 5") in the corona we obtain: 



/ 2;3-a-l ^ ^ 1 
\ 13-a ) 



(C43) ^H{x) = hm ^ / 



(l+A)(l-/3/3)-l 



d5" 



oo : ^?(a;) h{x) 

^ : ^(a;) ~ 1^ (fci)' /,(^) ^ i.07/.(a;) 
The normalization of the mass function is now: 
fi{M)dM = 0(0) 



(C44) 



X [{1 + A)il- 13/3) -il + d")]P{A, 6") 
Hence we obtain the general bounds: 
3 



7-/3 



1 



P <1 + S" > /A 

3 



1 + A 



^ Hjx) ^ 3-13 
- h{x) "7-/3 



(C50) 



(C51) 



where < 1 -\- 5" >/a is the mean value of (1 + 5") when 
there is a fixed density contrast A in the internal sphere 
V. For large x we have: 



(C45) 



which leads to: 

Jo ^ ' 7 v'4(/3 - a) + (/3 - 1)^ 



3-/3 

a; ^ X* : /i(x) < H{x) < ah{x) 

7-/3 



(C52) 



1.09 (C46) 



In the regime where the scaling in x for P{A) is valid, 
we obtained dM = (1 + A)P(> A, < A') = ef{x) where 
f{x) is a function of the sole variable x, and e = 3dR/R. 
If /? = 7, objects arc defined by a constant .t. which leads 
which is close to unity. Thus, the scaling functions h{x) within this framework to a mass fraction n{R)dR/R in 



and H{x) are very close. 



objects of scale R to R + dR: fJ,{R) = 3/(a;)=constant. 
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Hence we cannot define a mass function in the case (3 = 
7. The fact this value of (3 leads to some problems was 
already apparant in section 5.1. 

In the case /3 > 7, we obtain in a similar fashion: 

P(<A,>A')= I ^ ja+Aib+(i-g/3)y'l-^T/£ 

with Re(y") < 0, and we can write (1+A)P(< A, > A') = 
—xH{x)dx, with dx < 0. We can also express H{x) as: 

xH{x) = Jim / d6" 

i^oo P - 7 J(l+A)(l-/3/3)-l 

X [(1 + ,5") - (1 + A)(l - /3/3)]P(A, 6") (C53) 
If 7 < /? < 3, we obtain the bounds: 

<l + 5" >/A 



1 + A 
which lead to: 



^ /3 - 7 ^ < 1 + >/A 
- 3 h{x) - 1 + A 



37 3!/ ^ 



70; 



) - /3-7 



(C54) 



3 + 



1-u 



In the case /3 > 3, a simplification occurs because the 
lower bound of the integral in (C53) is smaller than —1. 
This is linked to the fact that for such a steep constraint 
the other probability P(> A, < A') is zero, because the 
average density of the halo cannot decrease faster than 
by conservation of the mass (in other words there are 
no negative densities). Then, (C53) reads: 



xH{x) = Jim 

^ — *oo 

so that 



H{x) = h{x) 



3e^(A) 



<l + 5">/A-(l + A)(l-|) 



< 1 



>/A 



/J-7 
which leads to 

a; a;* : H{x) = h{x) 
X <^ Xt, : H{x) = h{x) 



1 + A 



-1 + /J/3 



/3-7 



7C1J 



1 



(C55) 

(C56) 
(C57) 



We can note that for |/3| 00, which corresponds to 
objects defined by a constant radius R, we get H{x) = 
h{x). 

As we noticed in the main text, Chapter 5, the mass 
functions obtained above are not entirely satisfactory as 
there is no guarantee that the density of the "objects" 
picked out in this way decreases at larger radius, even 
locally at the considered scale i?, if /3 < or /? > 7. Hence 



we are led to include explicitely the additional constraint 
that the density profile is locally decreasing. 

Thus, in the case /? < wc can define the function 
H{x) such that the mass fraction of the objects of interest 
is dA4 = xH{x)dx by: 

/■oo /■Mm(A',5) 

xH{x)dx = (1 + A) y d5 j d6'P{6, 6') (C58) 

However, no general bounds can be obtained. 

In the case /? > 7, we define in a similar fashion: 



xH{x)dx = -{1 + A) dS dS'P{5,6') 

J A' J A' 

which leads to 

xH{x) 



(C59) 



3^ 



A 



Jim -p— / d5" 

i^oo P - 7 J(l+A)(l-,3/3)-l 



(C60) 



X [(l + 5")-(l + A)(l-/3/3)]P(A,5") 
If /? > 3 we obtain the upper bound: 



H{x) 



< 



h{x) ~ /3 - 7 [ 1 + A 
which gives 

x^ Xt, : II{x) < h{x) 
a; <C X* : H{x) < h{x) 



<l + 5" >/A 



/J-7 



7w 



1 - w 



(C61) 

(C62) 
(C63) 



If /3 ^ 00, wc obtain H{x)/h{x) = P{A,S" < A)/P(A). 
Hence H{x) < h{x). Moreover, for x » x* we have < 
l+f^" >/A= (l-7/3)(l+A) which implies that P(A,^" < 
A) > 7/3 P(A). Hence we get: 

/? ^ 00 : H{x) < h{x) , a; > a;* : H{x) > 7/3 /i(a;)(C64) 

Hence, when we add the constraint that the density 

profile is locally decreasing wc usually cannot get general 
bounds, but the mass functions still present the scaling in 

X. 

D. Evolution of an over(under)-density (Peebles 
1980) 

D.l. Spherical collapse for il = 1 

We consider the collapse of a sphere of mass M, radius r. 
The motion of the external shell is given by 



fr _ gu 

d^~ 

and a parametric solution of the dynamics is: 
'r = A{1 - cos9) 



t = B{e- sine) 



with = gUB^ 



(Dl) 



(D2) 
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The sphere will collapse at Ocoii = Stt, tcou = 2ttB. The 
density of the background universe is pb = l/(67rtji^), 
which translates into a non-perturbed radius = 
9/2 QMt^. The density contrast of the overdensity is 



1 + <5=P 
V r 



9 1 f t_ 

rJ " 2 (1 - cos6')3 \B 



(D3) 



corresponds to the contrast evaluated using the linear the- 
ory 

2/3 



3 (Qt 



At the time of collapse the latter is 

5co = 3/20 (127r)2/3 ~ 1.69 



(D4) 



(D5) 



If there is no kinetic energy at the maximum radius Rm = 
2A, the energy conservation and the virial theorem imply 
that the equilibrium radius is Ry = Rm/'2 = A, and if 
the object is virialized at the time tcoii it has a density 
contrast A^, at this instant given by: 



1 + Ac 



n{tcou) 

Ry 



IStt^ ~ 178 



(D6) 



If the overdensity collapsed at the redshift z, its present 
"linear" density contrast is thus: 



5c{z) = 6co (1 + z) (D7) 
The dynamics of an underdense region is given by: 
V = A{coshr] — 1) 

t = i?(sinh77 — rj) 
We have: 



with A^ = gMB'^ 



(D8) 



1 + 6= 

and 



1 



r J 2 (coshr; - 1)3 



2/3 



(D9) 



(DIO) 



D.2. Spherical collapse for Q < 1 , A = 

The dynamics of a spherical overdensity of mass M which 
will eventually collapse is the same as previously, but the 
evolution of the background universe is now given by: 



Vb = Ab{coshrib - 1) 

t = Bbismhrib - rib) 
One also has: 



with A? = QMBl 



(Dll) 



r?60 = cosh-i - 1^ and Bb = ^H^^Vlo{l - Qq)"^/^ 



The density contrast is then 
. 3 / cosh rjb — 1 



1 + 6 



Bb 
B 



\ 1 — cos 6 

The growing mode of the linear approximation is 
3 sinh r]b (sinh rjb - rjb) 



(D12) 



(coshr^b — 1)^ 



which is normalized so that D{t 
we have 



6L{t) ^-Dit) 



1+ 



Bb 
B 



2/3 



(D13) 

oo) = 1. At small times 
(D14) 



with 

m 



1 

10 



B'b) 



2/3 



At the time of collapse tcoii 
contrast is 



for t « Bb 

2ttB the linear density 



1 + 



f2-KBb 



2/3' 



V icoU 

and the density contrast of the virialized object is 

27r 



1 + Ac = (cosh 77bcoH - 1) 



(D15) 



(D16) 



,sinh776coH - rjbcoll. 

If fio = 0.1, their present values are : 6c — 1-62 and 
Ac ~ 978. If the overdensity collapsed at the redshift z, 
its present "linear" density contrast is thus: 



^c(^) = -D{to 



1 



(2TTBbV"^ 

V tiz) ) 



(D17) 

where t{z) is the age of the universe at the redshift z. 



However, in a low density universe some overdensities 
will never collapse. As can be seen from (D14), they are 
characterized by < 6L{t) < 3/2 D{t). In such a case, the 
dynamics of the overdensity is given by (D8). The density 

contrast is 



1 + ^ 



\3 / cosh 77b — 1 



\ cosh r] — 1 
and the linear density contrast is: 

2/3' 



6L{t) = -D{t) 



1- 



(D18) 



(D19) 



The dynamics of underdense regions characterized by 

6L{t) < is also given by (D8), hence the previous results 
(D18) and (D19) apply. Note that overdensities which 
will collapse satisfy 5L{t) > 3/2 D{t) and B > 0, while 
overdensities which will never collapse have < 5i^{t) < 
3/2 D{t) and B > Bb, finally underdensities are charac- 
terized by 6Lit) < and < B < Bb. 
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E. Evolution of the correlation function 

E.l. Relation of ^ with S 

Initially wc divide the early universe into cells of conioving 
size X, and we consider a simplified model where we have 
only two populations of cells: overdense ones with density 
1 + (5-1- and underdense ones with density 1 + 6^, siich that 
when a ^ we have 5^ = — 5_ = 6l, and the density 
contrast of these cells evolves with time according to the 
spherical model. In the early universe there are as many 
cells 6+ as cells At any time, separating the universe 
in cells of the same size r = ax, we have a fraction 77+ of 
overdense cells and a fraction r]- of underdense ones, with 



11+ + V- ==1 
7]+ 6+ + 1]^ 6^ ~ 

From this system we obtain: 
Thus, in the linear stage we have 



and ^ ~ ^2^, while in the non-linear regime we get 



(El) 



(E2) 



(E3) 



(E4) 



since 6- ~ —1. These two relations fit exactly into Pee- 
bles' qualitative picture. They show that the hierarchical 
clustering argument describes the evolution of the fluc- 
tuations in cells of a given size, which we have called ^ 
throughout this paper (and is called or in the linear 
case). As wc said above we take the time evolution of the 
average density contrasts (5+ and 6- to be the one given 
by the spherical collapse model. Of course, this implies 
another simplification since the radii of the cells (5+ and 
6- evolve in a different way with time, while (E2) is de- 
rived in the case where the universe is divided into cells of 
the same radius. Hence we consider that the density con- 
trast within underdense regions 6- is constant, so that wc 
get the previous results with a comoving cell radius xnl, 
entering ^, equal to the radius of the cells 6+. 

It is interesting to note that, even in this very sim- 
ple picture, we already find that, at late times, although 
overdense cells occupy a negligible fraction of the vol- 
ume: ry+ ~ and ry_ ~ 1, they contain all the mass: 
(1 -I- 6+)r]+ ~ 1 and (1 -|- (5_)r?_ ~ 0. 

E.2. Evolution o/J i/0 = 1 

From (D3) it can be seen that 5-^- is a fmiction of t/B, 
since 9 depends only on t/B from (D2). Hence it is a well- 
known function of ^i, = a Sq through (D4). Similarly, 



through (D8) and (D9) 6- is seen to be a function of the 
same ratio t/B, and using (DIO) of 5l- We thus get 



(E5) 



which defines a model for F , or at least explains the origin 
of the scaling for ^ as a function of = o^Sq. However, 
we cannot follow strictly the spherical collapse model for 
5+ as we have to take into account the virialization of the 
overdensity within a finite radius. To do so, we write 



1 + ^+ = 



1 



2 f{ef 



(E6) 



where f{9) = r/A is equal to (1 — cos0) as ^ 0, see (D3), 
and f{9) ~ 1 when 9 ^ ir, with = A the final virial- 
ization radius, following the usual model with no kinetic 
energy at the time of maximum expansion. We shall get 
the precise shape of f{9) from the numerical simulations 
performed by Jain et al.(1995). 

In a more explicit form, if we define the function ^(t) 

by 



't > : 5(r) 



with T = ^ — sin ( 



r < : 5(t) = T^/(cosh7/ — 1)^ with r = sinhry — rj 



we get 



9 / t 
9{- 



- 1 



9 / t 
9 



- 1 



(E7) 



2"V BJ Ji2^ \B, 
where r is the physical length at which we evaluate ^, and 



3/2 



t _ 1 {2{)5l 
B~6 

with 



5l = {1 + z)-' So [(1 + z)r[l + ^(r, z)]'^^ 



(E8) 



(E9) 



This defines the function F such that ^ — F{^]^), and gives 
an implicit equation for ^ once So(a;), the initial fiuctua- 

tion spectrum extrapolated up to z = by linear theory, 
is known. More precisely, we get for F(x) the expression: 



" 2^ 



3/2 



iff) 



1 /20 



3/2 



,3/4 



(ElO) 



and we recover the asymptotic behaviour of F{x): for 

small X we have F{x) ~ x while for large x we get 



F{x) ~ (10/3)= 



r-3/2 



In fact, as was shown by Jain et 



al.(1995) from numerical simulations, the dynamics is not 
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exactly given by the spherical model, and there is a correc- 
tion which depends on the index n of the power-spectrum. 
Thus we write, in a fashion similar to these authors: 



^ = Fn{U) with Fr,{x) 



a 



(Ell) 



which ensures that we still have ^ ~ if ^ 1 and 

— —3/2 — 

^ oc if ^ 1- The parameter a depends on n, 

and is such that the virialization radius is in this picture 
Rv = a A. Moreover, Jain ct al.(1995) considered the 
quantity ^, so we have to add a correction to get a for ^ 
which is the quantity we are interested in. If the two-body 
correlation function is a power-law ^(r) cx r~'^ we have: 



(E12) 



In the linear regime 7l = 3 + n while in the highly non- 
linear regime > 177) we have 7 = 3(3 + n)/{5 + n). 
This allows us to get the correction a (for ^) from the 
correction a (for ^) obtained from the fits given by Jain 
et al.(1995). These parameters are shown on table 2. 



Table 2. Parameters a and a for various power-spectra. 




-1 
-2 



1 

1.33 
1.25 



1.8 
1.5 

1 



1.34 
1.33 
1.25 



1.30 1.43 
1.24 1.18 
1.13 1.09 



Note that we consider in this model the evolution of 

— 1/2 

an ovcrdcnsity which is initially equal to = E, and 

we relate the evolution of the correlation function to the 
evolution of this particular density contrast. In fact, a 
more detailed treatment woiild be to consider the evo- 
lution of all initial density contrasts (distributed accord- 
ing to the usual gaussian) from the spherical model, and 
then take an average to get the fluctuations ^ produced 
by these overdensities, as in Munshi and Padmanabhan 
(1996). However, with such a model only half the mass is 
in overdensities, as in the PS approximation, contrary to 
our model. One usually cures this by multiplying the PS 
mass function, or in the present case the mass contained 
in any range of overdensities, by a factor of 2, so that all 
the mass is in overdense regions, and none in underdense 
ones. Nevertheless, this correction is rather artificial, in 
spite of the argument based on the result of the excursion 
sets formalism (that anyway holds only for the quite unre- 
alistic top- hat filter in k), and as we note in section 4.1 it is 
indeed insufficient to give acceptable results. Therefore, a 
detailed treatment of all initial density fiuctuations, with 
a final averaging, is probably unnecessary as it is based 



on a picture of the density field which is wrong at least 
by a factor of 2, and thus should not give a very reliable 
value of the normalization a. Thus, we feel that although 
the spherical collapse model contains parts of the rele- 
vant physical processes, and can be used to get averaged 
quantities like ^ within a correction factor close to unity, 
it should not be pushed too far to study the individual 
behaviour of all density fluctuations. 

From the explicit analytical flt for {x) given by Jain 
et al.(1995) we get directly fo{9). More precisely, we use 
the following form for fo{9) as a function of r = t/B, 
which in turn is related to 6 through (D2): 



(9/2) _ 3/10 (3/4)1/3^4/3 ^ (^/e)4 

1 + (r/6)4 



(E13) 



The terms in r^/^ and t^/^ correspond to the first and sec- 
ond order of (1 — cos 0) naO ^ {) and t ^ 0, which we need 
to recover 5+ ~ 5l as 5l 0, and the term in ensures 
that /o(t) ^ q;o as t ^ 00. Then we can derive F{x) and 
Fn{x) from ^0(2;), since we have: F{x) = a^^ F^i'StQ x) 
and Fn{x) = (a/cio)^ FQ[{a/aQ)~^ x], where the param- 
eter a which depends on the power-spectrum is given in 
table 1. 



log(0 




log(^L) 



Fig. 12. The correlation function ^ as a function of ^l, for 
n = —2 and n = 0. The solid line corresponds to our formu- 
lation (E7) or (ElO) while the dashed-line is the fit given by 
Jain et al.(1995). 



Fig. 12 shows the comparison for ^ between this ana- 
lytical formulation and the fits given by Jain et al.(1995) 
for the power-spectra n = —2 and n = 0. It is inter- 
esting to note on this figure that while the highly non- 
linear regimes are quite close for all spectra - indeed we 

— 3/2 

have ^ = (10/3a)^ the intermediate regime for 

1 < ^ < 100 shows wider variations. Note that the param- 
eterization (Ell) implies that the "virialized" regime is 
reached at a constant value of ^j^/a^, that is at 5l oc , 
or at a time t oc a^/^, for ^ oc a^. 

The advantage of our calculation, in addition to the 
fact that it enlightens the origin of the function F, is that 
it can readily be extended to the case f2 < 1 , A = 0, as 
we shall see now. 
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E.3. 17 < 1 , A = 

In the case of a low-density universe we still use (E2) but 
the time evolution of ^+ and 5- and their dependence on 
5l are different. Moreover, as we note in App.D.2 , some 
overdensities (5+ will never collapse. We first consider the 
case of an overdensity which will eventually collapse (large 
1l ■ ^Lit) > 3/2 D{t)). Using the relation 



(coshr/f, - 1)3 = 2 H-^l^^ (1 + z)" 
we get from (D12) 



(E14) 



1 + S+ 



where f{6) is the function we introduced above, see (E6), 

in the case f2 = 1 . Wc can note that Sj^^ is no longer a func- 
tion of the sole ratio t/B. Indeed a simplification occurs 
for = 1, where Qo{Hot)'^{l + z)^ = 4/9 is a constant, 
which leads to (E6). Next, to get the relation ^ 6 at a 

given redshift z, we have to infer B from 6l = ^fCh- This 
is done from (D14), which leads to 



B = Bh 



ML 



-3/2 



(E16) 



The dynamics of overdensities which will never collapsc\ 
and of underdensities (corresponding to S-), is described 
by (D8). Hence or S- is given by (D18), which leads to 



1 + = 



no{Hot)^il + z)^ (coshr;-l)3 \B 



(E17) 



where 77 is defined by (D8). Finally, B is given by (D19) 
which leads to 



B = Bh 



ML 



-3/2 



(EI8) 



Note that contrary to the case = 1 the cells 5+ and 5^ 
have a different parameter B. All these relations define a 
function F analogous to the case = 1 such that 



(E19) 



but F is now a function of two variables: and the red- 
shift z or expansion factor a. 
More explicitely, we can write 



^{r,z) = - 



2 g{t/B. 



Oo(7?ot)2(l + z)3 

2 9{t/B+) 

no{Hot)Hi + z)^ 



(E20) 



where r is the physical length at which we evaluate ^, 5 is 
the function we defined in the previous section in the case 
= 1, B+ and B- are given by 



'Sl > 3/2 D{t) : B+ = Bb [2/3 dL/D{t) - 1] 



- 11-3/2 



<3/2D{t) : B+ = 

and 

B_=-Bb [1 + 2/3 SL/D{t)]-^/^ 
Finally, 6l is obtained from 



(E21) 



B, [1-2/3 6L/D{t)]-^/^ 

(E22) 



(E15) Sl 



m 
m 



{l + z)r[l+ar,z)]'/' 



(E23) 



This defines the function F such that ^ = F{£^^,a), and 
gives an implicit equation for ^ once Tiq{x) is known. 

As was the case for O = 1, we must add a correc- 
tion which depends on the index n of the power-spectrum. 
Thus, in a fashion similar to what we did in the previous 
section we write: 

l = Fn(^L,a) with Fr,{x,a)=a'' F(^^,a) (E24) 

The parameter a, which measures the virialization radius 
of overdensities, is taken as independent of time. Since 

— > 1 when a — > 0, it must take the values given in table 

1 obtained for a critical universe. 



log(0 




log(ej 



Fig. 13. Evolution of the correlation function ^ as a function of 
with the redshift for n = — 2. The dashed-line corresponds 
to the case f2 = 1 (or ^; ^ 00), while the soUd lines correspond 
to fio = 0.3 for ^; = and z = 2. 



When f ^ 0, wc have 0^1, and relations (E20) and 

(E24) tend to the relations (E7) and (Ell) wc got in the 

previous section in the case of a critical universe, as we 

can see on Fig. 13. Moreover, in all cases we have ^ ~ 

as 1l 0, and when 1 > 200, 1{r,z) (x {I + z)"'^. We 

_ — 3/2 

can notice on the figure that ^oc^^ for » 10 what- 
ever the redshift even for Oq = 0.3. This is natural, since 
the overdensities corresponding to these large became 
non-linear in the early universe, when Q,{t) ~ 1, hence 
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they follow the same behaviour ^ oc as the one we 
got for a critical universe. The value of which marks 
the start of this dependence docs not vary very much with 
redshift because D{t oo) = 1, hence the correspond- 
ing to a given scale which turned non-linear in the early 
universe will not increase up to infinity but will remain 
finite as t ^ 00. This also implies that for a given over- 
density, or initial scale, we can see on the figure that while 
"saturates" and remains constant after some time, the 
corresponding value of ^ keeps increasing with time as 
^ (X {1 + z)~^ and moves further away from the fl = 1 
curve. More precisely, for a given ovcrdensity character- 
ized by its fixed parameter <C i?b, we have in the viri- 
alized regime: ^ ~ (5+ oc a while Sl oc D{t) B^^'^, 

hence we get: 



e cx {a/D{t)f ^ 



3/2 



(E25) 



This may be written as a function of the density parameter 



with 



2 [l-Ff24/7 + n/2] 



(E26) 



(E27) 



using the fitting formula of Carroll et al. (1992). Thus, 

by construction wc recover the behaviour noticed by Pea- 
cock and Dodds (1996) in numerical simulations. The fact 
that the latter show that all the redshift dependence is 
accounted for by the factor (i[f7(f)] confirms that the pa- 
rameter a should be constant, otherwise it would produce 
an additional redshift dependence. 
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